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Chapter  One 
Introduction 


The  unifier  on  problems  are  a  class  of  problems  involving  a  general  form  of  pattern  matching.  As 
such,  they  occur  independently  in  many  context  that  involve  symbol  manipulation,  the  most 
pronounced  of  these  being  automated  theorem  proving.  A  unification  problem  can  be  formulated  as 
a  problem  on  stnngs  graphs  or  algebraic  objects  such  as  sets  or  groups. 

The  classical  unification  problem  is:  given  two  terms  containing  function  symbols  and  variables, 
find  a  uniform  replacement  of  terms  for  the  variables  that  makes  the  two  terms  syntactically  identical. 
Equationai  unification,  or  E  unification  extends  the  classical  problem  to  solving  an  equation  in  an 
equational  theory  That  is  given  a  set  of  equationai  axioms,  find  a  substitution  for  the  variables  in  the 
two  terms  that  makes  them  provably  equal  from  the  set  of  axioms  describing  the  theory. 

In  this  thesis  we  develop  a  framework  for  automatically  combining  E-unification  algorithms  for 
independent  sets  of  operators  by  carefully  analyzing  the  E  unification  process  in  general  and  study¬ 
ing  a  number  of  equationai  theories  in  particular.  Our  approach  is  generalized  in  the  following  sense: 
Given  a  unification  algorithm  for  E,,  and  a  unification  algorithm  for  E2,  we  can  in  some  cases 
automatically  generate  an  algorithm  for  the  combined  theory  of  E,  and  E2,  such  that  the  resulting 
algorithm  will  unify  terms  with  mixed  set  of  operators. 

Unification  was  first  described  by  Herbrand  in  1930,  and  was  first  put  to  practical  use  by  Robinson 
as  a  basic  step  in  resolution  [Robinson  65],  an  inference  rule  used  as  a  complete  proof  system  for  first 
order  predicate  calculus.  Because  of  its  simplicity  and  power,  the  resolution  rule  is  often  used  as  the 
basis  for  automatic  theorem  provers  and  is  also  exploited  in  implementing  the  logic  programming 
language  Prolog  [Kowalski  74,  Clocksin  81].  In  Prolog,  unification  acts  as  a  procedure  call 
mechanism,  allowing  procedures  to  be  invoked  when  the  arguments  fit  the  pattern  given  in  the 
procedure  head. 

Unification  is  also  used  in  type  inference  algorithms  for  languages  such  as  ML  [Milner  78],  in 
which  type  inference  is  used  as  a  compromise  between  strictly  typed  and  typeless  languages.  These 
languages  gain  expressive  power  over  explicitly  typed  languages  that  enforce  strong  type  checking, 
because  the  type  inference  provides  the  programmer  with  a  mechanism  for  a  certain  kind  of  polymor¬ 
phism  [Mitchell  84], 
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Unification  is  also  an  important  operation  in  term  rewriting  systems  These  systems  perform 
reasoning  by  compiling  equations  into  a  set  of  rewrite  rules.  This  compilation,  known  as  the  comple¬ 
tion  process,  involves  ordering  each  equation  into  a  directed  rule,  finding  pairs  of  rules  that  could 
apply  to  a  single  term,  and  sometimes  adding  new  rules  when  such  critical  pairs  are  found.  Unifica¬ 
tion  is  used  in  finding  the  pairs  of  overlapping  rules  and  in  generating  the  additional  rules.  If  the 
completion  process  terminates  successfully,  the  resulting  system  is  called  convergent,  meaning  the 
rules,  applied  in  any  order  to  a  given  input,  will  always  result  in  a  unique  answer  [Knuth  70].  A 
convergent  rewriting  system  is  a  complete  and  terminating  decision  procedure  for  determining 
whether  or  not  an  equation  is  implied  by  the  original  set  of  equations.  Term  rewriting  systems  can  be 
used  as  a  basis  for  automatic  theorem  provers  [Huet  82,  Kapur  84,  Goguen  80,  Hsiang  82],  These 
theorem  provers  have  been  used  for  applications  such  as  checking  formal  specifications  [Goguen 
79,  Guttag  83,  Kownacki  84],  interpreting  logic  programming  languages  [Dershowitz  83a,  Fribourg 

84] ,  reasoning  about  relational  databases  [Cosmadakis  85],  and  checking  properties  of  petri  nets 
[Choppy  85]. 

The  unification  algorithm  described  in  this  thesis  was  initially  motivated  by  the  need  to  extend  the 
domain  of  applications  for  which  the  REVE  term  rewriting  system  generator  [Lescanne  83,  Forgaard 
84a]  is  useful.  Because  unification  problems  occur  in  many  different  applications,  there  are  both 
theoretical  and  pragmatic  reasons  for  developing  a  better  understanding  of  the  problem  and  its 
solutions.  A  condensed  version  of  the  work  described  in  this  thesis  appeared  previously  in  [Yelick 

85] , 

1.1  Organization  of  the  Thesis 

The  remainder  of  this  chapter  is  devoted  to  giving  the  background  necessary  for  understanding 
unification,  both  classical  and  equational,  and  discussing  related  work  in  the  field.  The  rest  of  the 
thesis  is  divided  into  three  chapters.  Chapter  2  presents  our  generalization  of  the  problem  along  with 
some  some  restrictions,  gives  our  algorithm,  and  goes  through  a  non  trivial  example.  Chapter 
3  presents  a  proof  of  total  correctness  for  the  generalized  algorithm.  Chapter  4  presents  a  summary 
of  our  conclusions,  a  description  of  the  implementation,  and  ideas  for  future  work.  The  reader  can 
get  of  clear  picture  of  our  approach,  ignoring  the  question  of  why  the  algorithm  works,  by  reading 
Chapters  2  and  4.  Three  appendices  are  included.  Appendix  A  gives  some  additional  technical 
details  of  the  proof  of  correctness,  Appendix  B  gives  a  glossary  of  some  terms  used  in  this  thesis,  and 
Appendix  C  gives  a  list  of  special  symbols  and  there  uses. 


1 .2  Definitions 


The  following  definitions  are  consistent  with  the  definitions  of  [Fages  84]  and  [Huet  80a].  We 
begin  with  basic  definitions  of  terms  and  functions  on  terms. 

Let  V  be  a  countable  set  of  variables  and  F  be  a  family  of  function  symbols  with  associated  arity 
such  that  V  and  F  are  disjoint.  We  recursively  define  the  set  of  terms,  T(F,  V),  as  either  a  variable  or  a 
function  symbol  of  arity  n,  followed  by  n  terms.  We  assume  the  sets  V  and  F  to  be  fixed  and,  thus,  use 
T  in  place  of  T(F,  V)  without  ambiguity.  Function  symbols  of  arity  zero,  called  constants,  will  be 
denoted  by  the  letters  a  b.  c,  J ,  and  numerals  0.  I ,  to  be  distinguished  from  variables,  denoted  by  the 
letters  u.  v.  >v,  x.  y,  z.  For  readability,  we  will  use  the  symbols,  + ,  *,  and  •  as  binary  infix  operators. 
Examples  of  terms  are  f{x,  a),  h(x  •  0),andy  +  /. 

Given  a  term,  t,  let  f[t)  be  the  set  of  variables  in  t  and  5(f)  be  the  set  of  function  symbols  in  f.  The 
root  symbol  of  the  graph  representation  of  a  term,  f,  will  be  denoted  t.head — t.head  is  a  variable  if  f  is 
a  variable,  and  a  function  symbol  if  t  is  not  a  variable.  Terms  formed  from  function  symbols  alone,  i.e., 
containing  no  variables,  are  called  ground  terms  and  are  denoted  by  G. 

An  occurrence  in  a  term  names  a  node  within  the  tree  structure  of  the  term;  occurrences  are 
represented  by  strings  of  integers,  including  the  empty  string,  e.  The  set  of  occurrences  of  a  term, 
denoted  0(f),  is  defined  as  follows: 

1.  If  f  is  a  variable  or  constant,  then  0(f)  =  {«}. 

2.  If  f  =/(t1 . fn),  then  0(f)  =  {£}U{i.o  1 1  s  i  <  n  &  oEO^)} 

An  occurrence  can  be  used  to  index  into  a  term  as  follows: 

1.  t/e  =  t 

2. /(f, fn)/i.o  =  f/0 

An  occurrence  is  said  to  be  proper  if  it  is  not  the  empty  occurrence  and  strict  if  the  subterm  at  that 
occurrence  is  not  a  variable. 

A  substitution  is  a  mapping  from  variables  to  terms,  extended  to  an  endomorphism  (a  homomor¬ 
phism  from  a  set  to  itself)  on  terms.  I.e.,  if  a  is  a  substitution  then  of(ty...,tJ  =  /(af1,...,afn).  A 
substitution  will  be  denoted  by  a  set  of  variable  to  term  mappings,  {v  1*-f1, v2<-f2,...},  where  all  variables 
outside  the  set  are  implicitly  mapped  to  themselves.  The  identity  substitution,  i.e.,  the  substitution 
mapping  every  variable  to  itself,  will  be  written  t.  The  universe  of  substitutions  will  be  denoted  S.  We 
define  the  domain ,  5,  of  a  substitution,  o ,  as  follows:  9(a)  =  {v  |  ov  *  v}.  Note  that  this  differs  from 
the  usual  notion  of  a  function's  domain,  since  it  contains  only  those  variables  that  are  not  mapped  to 
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the  '.scl'. -5  The  woe.  a.  of  a  substitution  is  defined  as  1(a)  =  U  {ar},  and  the  range 

i  €3(<j) 

i  j-  u:?.v.v. as  J(a)  =  U  A  substitution,  a,  can  be  restricted  to  a  set  of  variables,  V,  written 

o|  .  b/  mapping  all  variables  outside  V  to  themselves,  o|v  =  { v  \  |  v€V}. 

Thus,  if  a  =  (x  *\i.w  \  *-/(-')}.  then  o/(v)  =  /(,;),  o.x;(--.v)  =  tf(-r  •/(*))  and  «rfjwj  =  {»•-<;}.  The 
domain,  range,  and  range  variables  of  a  have  the  following  values:  3(a)  =  {.*,>  },  3t>(a)  =  {a./(z)}, 
and  3(a)  =  {r}. 

A  term.  t.  is  said  to  be  an  instance  o l  a  term,  s.  if  and  only  if  there  exists  a  substitution,  a,  such  that 
f  =  as.  When  the  domain  of  o  is  restricted  to  the  variables  in  3(s).  o  is  unique  and  is  called  the  match 
of  t  by  s.  Substitutions  may  be  composed  using  functional  composition,  i.e.,  for  any  term,  t, 
(o^°o2)  t  =  o^oj)  in  the  same  sense  that  one  term  may  be  an  instance  of  another,  a  substitution, 
a  ,  is  an  instance  of  a  substitution,  a2.  if  and  only  if  there  exists  a  third  substitution,  t,  such  that  a1  = 
r°o2:  in  this  case  a2  is  said  to  be  more  general  than  oy  We  denote  this  partial  ordering  by  a1  5  «r2. 


1.3  Classical  Unification 

This  section  presents  the  classical  unification  problem  and  provides  an  example  of  a  simple 
algorithm  for  solving  the  problem.  This  classical  algorithm  will  serve  as  a  framework  for  our  general¬ 
ized  algorithm 

Definition  Given  two  terms,  t  and  s,  a  substitution,  o,  unifies  1  and  s  if  and  only  if: 

of  'JS, 


in  general,  there  is  more  than  one  substitution  that  will  unify  two  given  terms.  For  example,  if  f  = 
/(*(*), ))  and  s  =  /(v.z)thena  =  ).  * «-«(£(* ))}  is  a  unifier,  as  well  as  o  =  {.i  *-g ($(*)), 

z*-  j:(g(}!(H ))),  .v  *•&’(«)},  and  an  infinite  number  of  other  substitutions.  Observe  in  the  preceding 
example  that  o  is  instance  of  o.  o'  =  {.*«-  g  (  v)}°o.  In  fact,  all  unifiers  of  f  and  s  can  be  written  as 
some  substitution  composed  with  a,  so  we  call  a  the  most  general  unifier.  The  most  general  unifier  of 
two  terms  is  analogous  to  the  least  common  multiple  of  two  natural  numbers;  every  multiple  of  two 
numbers  is  divisible  by  the  least  common  multiple,  just  as  every  unifier  of  two  terms  is  an  instance  of 
the  most  general  unifier. 

Definition.  The  most  general  unifier  of  two  terms  f  and  s,  is  a  unifier,  a,  such  that  V  <p 

<pt  =  tps  =>  (3r,  ip  =  r°o). 

In  classical  unification,  there  is  at  most  one  most  general  unifier  up  to  variable  renaming.  A  simple 
recursive  unification  algorithm  based  on  (Robinson  71]  is  given  below. 
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unify  =  proc  (t:  term,  s:  term)  returns  (substitution) 
case 

is_variable(f)  and  is_variable(s)  =» 
return({f«-s}) 
is_variable(f)  => 
iff€n«) 

then  failure:  cycle 
else  return({f  «-s}) 
is_variable(s)  => 
itserio 

then  failure:  cycle 
else  return({s  ♦•/}) 
t.head^s.head 
failure:  clash 
t.'iead  =  s.head  => 
o  :=  i 

for  i  -  1  tr  arity(t.head)  do 
a  :  =  unify(at/i,  as/i)°o 

end 

return(a) 

end 
end  unify 


Figure  1-1:  A  Classical  Unification  Algorithm 


The  algorithm  points  out  two  cases  in  which  it  is  not  possible  to  unify  the  input  terms.  If,  at  any 
point  in  the  unification  process,  it  is  necessary  to  unify  two  terms  in  which  the  head  operator  symbols 
are  not  the  same,  a  clash  occurs  and  the  terms  are  not  unifiable.  The  second  case  is  called  a  cycle 
and  is  succinctly  shown  by  trying  to  unify  the  terms  x  and  f(x).  The  unifier,  {x  (■■■))))}  is 

infinite,  although  it  can  be  finitely  represented  by  a  cyclic  graph.  In  some  applications,  infinite  unifiers 
are  allowed  or  even  desired  [Filgueiras  82];  in  many  other  applications  infinite  unifiers  would  lead  to 
non  terminating  program  behavior.  We  will  include  the  cyclicity  test  to  disallow  infinite  unifiers,  but 
note  that  a  unification  algorithm  can  be  easily  modified  to  allow  infinite  unifiers  by  removing  the 
cyclicity  test. 


1 .4  Equational  Unification 


As  suggested  at  the  outset,  we  will  use  the  word  "unification"  to  stand  for  not  the  single  problem 
of  section  1 .3,  but  for  a  class  of  problems  which  differ  from  classical  unification  according  to  the 
desired  notion  of  equality.  In  particular,  we  will  be  using  equality  defined  by  an  equational  theory, 
although  we  could  also  imagine  using  non  equational  or  even  higher  order  logics.  The  equational 
unification  problems  use  a  form  of  equality  that  is  weaker  than  syntactic  equality  and  are  therefore 
relevant  to  applications  in  which  a  less  rigid  matching  process  is  needed. 

1.4.1  Applications 

Until  recently,  most  applications  made  use  only  of  classical  unification,  however,  the  need  for 
equational  unification  is  clear.  A  number  of  operators  that  occur  frequently  in  practice  have 
properties  described  by  equational  theories,  and  equational  unification  provides  at  least  one  tech¬ 
nique  for  reasoning  about  operators  with  these  properties.  Furthermore,  other  common  reasoning 
techniques,  such  as  resolution  and  term  rewriting,  do  not  handle  a  number  of  useful  equational 
properties  that  can  be  handled  by  incorporating  equational  unification. 

For  example,  the  properties  of  associativity  and  commutativity,  called  the  AC-theory,  can  be 
described  by  the  equational  axioms: 

^.x•^y•z)  =  (x*y)’z 
2.  x‘y  =ymx. 

The  integer  operations  of  plus  and  times  are  only  two  of  the  many  examples  of  associative  and 
commutative  functions  about  which  we  would  like  to  be  able  to  reason  automatically.  Despite  the 
prevalence  of  AC  operators,  basic  term  rewriting  systems  and  resolution  systems  run  into  difficulty 
handling  this  theory. 

Another  example  of  an  interesting  equational  theory,  less  familiar  to  the  mathematician,  occurs  in 
data  type  specifications  for  sets;  the  insert  operation  is  "commutative"  in  the  sense  that  the  order  of 
inserting  elements  into  a  set  is  not  important.  This  property  can  be  axiomatized  by  the  equation: 
msert(insert(s,  e^),  e2)  =  insert(insert{s,  e2),  ej,  which  is,  again,  a  problematical  axiom  for  term 
rewriting  and  resolution  based  systems. 

The  difficulty  with  both  the  AC-theory  and  the  insert  operation  comes  from  the  symmetry  of  the 
axioms,  which  allows  them  to  be  used  repetitively.  A  simple  example  using  the  commutative  axioms  in 
a  Prolog  program  will  exemplify  the  problem.  Consider  the  Prolog  program  in  Figure  1-2,  which 
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corta  ns  t'v-  user  defined  s  i'U-'g  relation  The  first  three  lines  are  input  by  the  programmer:  line  one 
asserts  that  ve  and  mary  are  s-bi-ngs.  line  two  asserts  that  the  sibling  relation  is  commutative,  and 
line  three  is  a  query,  asking  for  the  siblings  of  mary.  In  response  to  the  query,  the  Prolog  interpreter 
returns  the  value  /oe  for  X,  since  joe  was  asserted  to  be  a  sibling  of  mary.  When  the  interpreter  is 
promoted  for  any  other  sibi-^gs  of  mary.  the  interpreter  again  returns /oe.  The  program  will  continue 
to  loop  in  this  manner,  and  in  general  there  is  no  way  to  determine  that  all  distinct  answers  have  been 
found  so  that  the  process  can  stop.  For  similar  reasons,  the  associative  and  commutative  axioms  in  a 
term  rewriting  system  cannot  be  oriented  into  rewrite  rules  without  losing  the  termination  property  of 
the  system. 


siblingfmary,  joe). 
siblingfX,  Y) sib!ing(Y,  X). 
?  siblmgfmary,  X). 

X  =  joe; 

X  =  joe; 

X  =  joe; 


Figu  re  1  -2:  A  Prolog  Program  with  Commutativity 


In  both  resolution  and  term  rewriting,  a  solution  to  the  problem  is  to  build  the  symmetric  axioms 
into  the  system,  i.e..  into  the  unification  process,  so  that  the  axioms  are  not  explicd'y  needed.  [Plotkin 
72]  describes  the  extension  of  resolution  to  resolution  with  equational  unification  and  [Peterson 
81 ,  Dershowitz  83b.  Jouannaud  84]  describe  extensions  of  term  rewriting  systems  to  equational  term 
rewriting  systems.  Resolution  and  term  rewriting  systems  are  two  of  the  current  uses  of  equational 
unification,  but  applications  are  by  no  means  limited  to  these  two.  A  review  of  some  ideas  for  using 
equational  unification  is  given  in  [Siekmann  84]. 
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1 .4.2  Equational  Theories 

The  equational  unification  problem  will  be  defined  in  Section  1.4.3.  For  that  purpose,  we  will  need 
an  understanding  of  equational  theories  and  some  related  definitions.  This  section  gives  the  neces¬ 
sary  formal  background.  There  are  two  approaches  to  presenting  equational  theories:  proof  theoretic 
and  semantic.  The  proof  theoretic  approach,  based  on  syntactic  inference  rules,  is  presented  first 
and  the  semantic,  or  algebraic  approach,  is  presented  second.  The  key  result  in  the  study  in  equa¬ 
tional  logic  is  the  work  of  Birkhoff,  who  proved  that  the  two  characterizations  of  equational  theories 
are  equivalent  [Birkhoff  35,  Gratzer  78], 

An  equation  is  a  pair  of  terms,  f  =  s.  A  congruence  relation  is  an  equivalence  relation,  closed 
under  the  equality  rule : 

t.  ~  1  <  i  s  n  =>  /(/1 tn )  ~/(s1 sn)  for  all / £F  of  arity  n. 

Given  a  set  of  equations  E,  the  equational  theory  presented  by  E  is  the  set  of  equations  E*  formed  by 
the  finest  congruence  over  T  that  contains  E  and  is  closed  over  instantiation.  We  will  denote  this 
congruence  relation  by  t  =s,  meaning  t  =s  €  E*.  E*  is  exactly  the  set  of  equations  derivable  from  E  by 
a  finite  proof,  using  the  following  inference  rules  given  by  Birkhoff: 

1.  Reflexity:  x  =x  is  alway  an  axiom. 

2.  Symmetry:  From  t  |S  deduce  $  1 1. 

3.  Transitivity:  From  t  =r  and  r  =s  deduce  t  =s. 

4.  Equality:  From  /.  =sjt  1  S  i  S  n  deduce /(^ i'n;  =/’(s1„..,sn),/ of  arity  n. 

5.  Instantiation :  From  t  =s  deduce  ol  ~os. 

E  E 

We  will  consistently  use  E  and  E*,  respectively,  as  a  set  of  axioms  and  the  equational  theory 
presented  by  those  axioms.  Note  that  E*  is  uniquely  defined  from  E,  but  there  may  be  more  than  one 
presentation,  E,  given  a  theory  E*.  Even  if  E  is  an  irredundant  presentation  it  may  not  be  unique;  for 
example,  group  theory  has  a  number  of  distinct  irredundant  presentations.  This  fact  will,  in  some 
cases,  force  us  to  fix  the  presentations  of  theories  so  that  proof  theoretical  arguments  can  be  based 
on  a  well  defined  set  of  axioms. 

The  following  discussion  of  algebras  is  needed  for  giving  the  semantics  of  equational  theories.  An 
algebra ,  .A,  is  a  pair  (A,  F),  where  A  is  a  set  of  elements  called  the  carrier  of  A  and  each  f  6  P  is  a 
function  from  An  to  A  for  some  arity,  n.  A  mapping,  i>,  from  V  to  A,  (i.e.,  to  the  carrier  of  A)  extended 
as  a  homomorphism  from  T  to  A,  is  called  an  ^.-assignment.  It  is  important  in  this  discussion  to 
distinguish  between  the  semantic  and  syntactic  objects.  For  example,  if  A  is  the  set  of  integers  and  f 


a  set  of  integer  operations,  then  addition  is  a  function  in  P  and  the  number  "one"  is  an  element  of  A 
whereas  "  +  "  and  "1 "  are  syntactic  objects  for  which  we  may  choose  any  interpretation. 


One  of  the  simplest  models  that  exists  for  any  equational  theory  is  the  term  algebra ,  /  The  term 
algebra  has  as  its  carrier  the  set  of  terms,  T  =  T(F,  V),  and  as  its  set  of  functions  a  set  of  term 
constructors,  F,  one  for  each  function  symbol  in  F.  For  example,  if  /  is  a  unary  operator  in  F,  then 
there  exists  a  corresponding  unary  func*;on,  f  in  F,  such  that  f  maps  any  term,  t,  in  T  to  the  term  /(f). 
Because  the  carrier  of  /is  exactly  the  set  of  terms,  the  identity  map  is  one  example  of  a  '/•assignment. 

If  vt  =  vs  for  all  ^  assignments,  v ,  then  A  is  called  a  model  of  the  equation  t  =  s,  and  t  =  s  is  said 
to  be  valid  in  A;  we  denote  this  condition  by  A  N  /  =  s.  Validity  can  be  extended  to  a  set  nf 
equations  by:  A  t=  E  if  and  only  if  (f  =  s)  €  E  =>  A  N(f  =  s).  Given  a  set  of  equations,  E,  we  denote 
the  class  of  all  models  of  E  by  Jt(E)  and  the  set  of  equations  valid  in  a  class  of  models,  M,  by  Eq  M. 
Given  a  set  of  axioms,  E,  we  can  semantically  define  an  equational  theory  as  the  set  of  equations  valid 
in  all  models  of  E.  To  reiterate  the  equivalence  between  the  algebraic  and  proof  theoretic  charac¬ 
terizations  of  equational  theories,  note  that  the  soundness  and  completeness  of  the  above  inference 
rules,  as  proved  by  Birkhoff,  can  be  written  E  *  =  Eq  Jt>(E). 

If  there  exists  a  non  trivial  model  of  E,  i.e.,  U  =  (A,  f),  and  ./NE  and  |A|  >  1,  then  the  theory 
presented  by  E  is  said  to  be  strictly  consistent.  Syntactically,  an  equational  theory  has  only  the  trivial 
model  if  and  only  if  jt  =  y,  since  any  equation  is  a  substitution  instance  of  this  one.  The  unification 
problem  in  an  inconsistent  theory  is  always  trivial.  By  assumption,  we  will  work  with  only  strictly 
consistent  equational  theories. 


The  equivalence  relation  on  terms  defined  by  an  equational  theory  can  be  extended  to  an  equiv¬ 
alence  relation  on  substitutions:  e1  |  o2  if  and  only  if  V  v  €  V  (^v  =  a2v).  In  many  cases  we  are 
interested  only  in  the  effect  of  a  substitution  on  a  particular  set  of  variables,  V.  We  extend  our  defini- 

v 

tion  as  follows:  o1  =  a2  if  and  only  if  V  v  €  V  =  o2v.  Furthermore,  we  say  that  o1  is  more  general 
than  o2  modulo  E  over  V,  written  a1  if  a2,  if  and  only  if: 


There  exists  t  such  that  r°a,  = 

1  E  2 

y  v  v 

The  equivalence  relation  defined  by  o.  s  and  <r0  s  a,  will  be  denoted  ff„=cr„  or,  if  V  =  V,  by  o,son. 

'E*  ^E1  1 E  2 

The  relation  =  corresponds  to  our  intuitive  notion  of  equivalent  modulo  E  up  to  variable  renaming, 
v  ^ 

and  =  corresponds  to  same  relation  where  only  the  domain  of  V  is  considered. 


The  following  properties  hold  for  the  equivalence  relation  on  substitutions.  They  will  be  used 
freely  in  the  proof  of  correctness  in  Chapter  3  and  are  presented  here  to  avoid  distraction  during 
those  proofs  and  to  aid  the  reader  in  developing  intuition  about  =  on  substitutions. 


instantiation 


1 .0=0,.=*  a  “a.  =o,  °a„.  for  any  a, 

1  c  3  t  c  3  l  4 


2.  (<V02,o°3f  V(o2°ff3> 


3.  <t,  =<r2  <=>  <r|v  |ff|v 


4.  o  =c  ,  1  S i S  n  =»  o,°. .  °a  =  T,°— 
i£Ti  1  n  £  T  i  T  n 


associativity 

restriction 

equality  lor  composition 


5.  <T10<r2=<r2°<r1.if  (^(a^nJfaj)  =  0)  & 


limited  commutativity 


(cMo2)D3t<j,)  =  0)S(r6fJ(<71)na(<72))=><71>'=ff2»>) 


Having  presented  the  basic  concepts  involved  in  working  with  equational  theories,  we  are  now 
ready  to  examine  the  unification  problem  in  these  theories. 


1.4.3  A  Problem  Statement 

Equational  unification  is  the  problem  of  solving  an  equation  of  the  form  t  =  s  in  the  quotient 
algebra.  €J/  =  .  whose  carrier  is  the  set  of  congruence  classes  of  terms  defined  by  E.  If  E*  is  the  empty 
theory,  then  we  again  have  the  classical  unification  problem.  The  problem  is  distinguished  from  the 
problem  of  safshaC.i  ty  in  a  first  order  theory.  In  unification  the  interpretation  of  symbols  is  fixed  as 
the  term  algebra  interpretation,  whereas  determining  satisfiability  of  a  first  order  statement  is  the 
problem  of  finding  whether  there  exists  any  interpretation  in  which  the  statement  is  valid. 

Definition.  Let  t  and  s  be  terms  and  E  be  a  set  of  equations  A  substitution  o.  is  said  to  be  an 
E-unifier  of  t  and  s  if  and  only  if: 

of  =  os. 

E 


1.4.3. 1  Properties  of  Unification  Algorithms 

Let  U£  denote  the  set  of  all  E-unifiers  of  terms  f  and  s,  i.e.,  U£(t,  s)  =  {o  €  S  |  of  |  o  s}  and  let 
V  =  TffJU'tls).  As  in  classical  unification,  U£  is  infinite;  we  represent  it  by  a  complete  set  of  unifiers 
from  which  set  the  U£  can  exactly  be  generated  by  considering  all  instances  of  each  substitution  in 
the  set.  If  every  element  of  a  set  of  unifiers  is  necessary  for  completeness,  it  is  called  a  minimal 
complete  set  of  unifiers.  The  following  set  of  definitions  formalize  these  concepts. 

Definition.  Let  2  be  a  set  of  unifiers  of  f  and  s  and  V  =  T\t)U1Ts).  2  is  said  to  be  como'ete  if  and 
only  if  it  generates  all  unifiers: 

VoCUcff,  s)  3  o'€2  o  'io 
£  E 

Definition.  Let  2  be  a  set  of  unifiers  of  f  and  s  and  V  =  flOUtls).  2  is  minimal  if  and  only  if  no 
substitution  in  2  is  redundant: 


*  •„>  ^  m  *  If*  ’*  ”  *  -*■  '  '  .  *  -  -  •  rw  .  rvii..n.  -l  m 


Vo,a'€2  o  $  o' 

E 

When  it  exists,  a  minimal  complete  set  of  unifiers  is  unique  up  to  s  for  any  E*  [Fages  84],  The  size 
of  the  minimal  complete  set  is  bounded  for  certain  values  of  E.  If  E  =  0,  there  is  always  a  singleton 
complete  set  for  any  two  unifiable  terms.  If  E  contains  only  the  associative  and  commutative  axioms 
(the  AC  theory)  then  the  complete  set  is  always  finite.  If  E  contains  only  the  associative  axiom,  then 
there  are  some  pairs  of  terms  for  which  every  complete  set  of  unifiers  is  infinite.  If  there  is  a  finite 
complete  set  then  a  minimal  complete  set  always  exists  and  can  be  found  by  filtering  out  non-minimal 
unifiers  through  matching.  For  some  infinite  cases,  the  properties  of  minimality  and  completeness 
may  conflict,  so  that  no  minimal  and  complete  set  exists  [Fages  83a]. 

For  completeness,  it  may  be  necessary  for  an  E-unification  algorithm  to  use  more  variables  in  the 
range  of  the  unifiers  than  occur  in  the  terms  being  unified.  Because  unification  procedures  are  often 
used  within  a  larger  system  containing  variables  of  its  own,  it  is  useful  to  require  an  additional 
property  to  protect  the  existing  variables  from  being  used  as  new  variables. 

Definition.  Let  2  be  a  set  of  unifiers  of  t  and  s,  V  =  f[f)UfIs),  and  let  W  be  some  set  of  variables  to 
be  protected,  where  VCW.  2  is  protective  if  and  only  if: 

Va€2  3(o)CV  &  W-Vn5(o)  =  0 

&  3(o)n3(o)  =  0. 

Without  loss  of  generality,  we  will  assume  sets  of  unifiers  are  protective,  both  for  the  pragmatic 
reason  given  above  and  for  the  technical  reason  that  it  makes  unifiers  idempotent,  (i.e., 
9(a)n3(o)  =  0  =»  o°o  =  a)  which  will  be  used  in  the  proofs. 

The  properties  on  sets  of  substitutions  are  extended  to  properties  on  a  unification  procedure; 
collectively,  they  constitute  partial  correctness  of  a  procedure. 

Definition.  A  procedure,  E-unify  is  a  partially  correct  unification  procedure  for  E*  if  and  only  if  for  all 
terms  t  and  s  and  any  finite  set  of  variables  WDT(f)UT[s),  if  E-unify  terminates  with  a  set  of  substitu¬ 
tions  2,  then: 

1.  consistency:  a€2  =>  af  |OS. 

2.  completeness:  2  is  complete  for  t  and  s. 

3.  protection:  2  is  protective  of  t,  s,  and  W. 

If,  in  addition  to  being  partially  correct,  E-unify  returns  only  minimal  sets  of  unifiers,  then  it  is  said 
to  be  a  minimal  procedure.  A  procedure  which  is  partially  correct  and  terminating  is  called  totally 
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correct  and  is  referred  to  as  an  algorithm  rather  than  a  procedure.  Any  theory  with  a  terminating 
unification  algorithm  has  a  minimal  complete  set  of  unifiers  for  any  pair  of  terms.  However,  minimality 
of  the  algorithms  is  not  included  in  the  correctness  criteria  because  it  is  a  difficult  to  guarantee 
without  the  costly  filtering  process,  and  because  it  is  not  necessary  in  most  applications  of  E- 
unification. 

1 .4.3.2  Classifying  Solutions  to  the  Unification  Problem 

Historically,  there  are  a  number  of  ways  in  which  one  can  solve  the  unification  problem  for  a 
particular  theory,  E*.  We  will  classify  each  kind  of  solution  because  it  will  help  to  clarify  the  contribu¬ 
tions  of  our  approach  and  the  assumptions  behind  it.  The  terminology  used  here  is  not  well-defined 
in  the  literature,  but  we  establish  a  convention  based  on  common  usage  for  referring  to  each  kind  of 
solution  after  giving  its  characterization. 

1.  For  a  given  equational  theory,  E*.  the  simplest  unification  problem  is  called  the 
variable-only  case.  The  assumption  is  that  there  is  one  set  of  axioms,  E,  and  terms  to  be 
unified  contain  only  function  symbols  appearing  in  E  and  variables. 

2.  Unification  in  the  case  with  free  symbols  is,  again,  unification  with  a  single  set  of  axioms, 

E,  but  the  terms  may  contain  free  function  symbols,  i.e.,  unconstrained  symbols  of  any 
arity,  in  addition  to  function  symbols  in  E  and  variables. 

3.  The  multiple  instance  case  unification  problem  allows  more  than  one  instance  of  a  set  of 
axioms,  for  example,  the  AC  theory  for  +  and  the  AC  theory  for  *.  In  this  case  terms  still 
contain  only  function  symbols  from  the  axioms  and  variables,  but  a  single  term  may 
contain  more  than  one  operator  with  the  given  properties,  i.e.,  both  +  and  •. 

4.  The  unification  problem  for  combined  theories  is  to  take  sets  of  unrelated  axioms,  for 
example,  the  AC  theory  for  +  and  the  "commutativity  of  insert"  theory  for  sets,  and  allow 
unification  of  terms  containing  function  symbols  from  both  of  these  theories.' 

It  is  this  last  problem,  the  problem  of  combining  equational  unification  algorithms,  which  is  studied 
and  partially  solved  here.  This  thesis  provides  an  algorithm  for  combining  equational  unification 
algorithms  for  a  restricted  class  of  equational  theories  and  characterizes  some  of  the  theories  for 
which  the  combined  problem  is  not  solved.  This  problem  was  suggested  as  an  open  problem  in 
[Siekmann  84]  and  [Shostak  84], 


1 .5  Related  Work 


This  section  discusses  some  of  the  work  that  has  be  done  on  developing  unification  algorithms. 
Section  1.5  1  give  a  short  survey  of  algorithms  that  have  been  designed  to  solve  the  unification 
problem  m  one  particular  theory  Section  1.5.2  describes  a  class  of  unification  procedures  that  can 
be  automatically  generated  from  an  axiomatization  of  the  theory,  and  Section  1.5.3  looks  at  some 
work  related  to  the  general  problem  of  designing  algorithms  for  combinations  of  theories. 

1.5.1  Single  Theory  Algorithms 

A  number  of  unification  algorithms  have  been  developed  for  particular  equational  theories,  and  a 
great  deal  of  effort  has  been  devoted  to  improving  and  bounding  the  running  time  of  these  algorithms. 
It  is  interesting  to  note  that  theoretical  measures  of  complexity  for  these  algorithms  often  do  not 
reflect  their  relative  running  times  in  practice.  This  is  probably  because  of  the  small  size  of  terms  in 
the  average  case,  although  little  work  has  been  done  in  trying  to  formally  characterize  the  average 
case  for  a  unification  problem  or  in  measuring  the  performance  of  algorithms  based  on  an  average 
case 

For  the  empty  theory,  the  first  algorithm  was  described  in  [Robinson  71]  and  is  exponential  in  the 
size  of  the  input.  It  has  been  modified  by  representing  terms  as  directed  acyclic  graphs  rather  than 
trees  [Corbin  83]  to  give  an  n?  algorithm.  The  algorithm  of  [Paterson  78]  runs  in  linear  time  and  those 
of  [Martelli  82]  and  [Baxter  73]  run  in  nearly  linear  time.  [Martelli  82],  while  theoretically  slower  than 
the  linear  algorithms,  runs  faster  on  some  typical  examples.  Also,  the  modified  algorithm  of  [Corbin 
83]  is  fast  m  practice  and  has  the  additional  advantage  that  the  structure  of  the  algorithm  is  simple 
and  intuitive;  one  disadvantage  of  the  [Corbin  83]  approach  is  that  it  depends  heavily  on  a  data 
structure  for  terms  that  may  or  may  not  be  appropriate  within  an  application. 

Some  of  the  currently  known  complete  E-unification  algorithms  are  for  commutative  operators 
[Siekmann  79].  AC  operators  [Stickel  81,  Livesey  76]  (with  termination  in  the  multiple  instance  case 
proved  in  [Fages  84]),  signed  trees  [Kirchner  81],  one  sided  distributivity  [Arnborg  85],  and  transitivity 
[Kirchner  85].  There  are  variations  on  the  AC  algorithm  [Livesey  76,  Fages  84]  for  AC  with  idem- 
potence.  and  AC  with  a  unit  element,  AC  with  both  idempotence  and  a  unit.  An  algorithm  for  the 
variable-only  case  of  free  abelian  groups  is  given  in  [Lankford  84],  and  more  generally  for  finitely 
presented  abelian  groups  in  [Kandri-Rody  85].  [Kandri-Rody  85]  also  gives  unification  algorithms  for 
finitely  presented  boolean  rings  and  finitely  presented  boolean  rings  of  polynomials  (i.e.,  with 
idempotence). 


The  decision  problem  for  unification  in  the  associative  theory,  also  known  as  siring  unification, 
has  been  shown  decidable  [Makanin  77],  The  associative  theoa;  has,  in  general,  an  infinite  set  of 
most  general  unifiers,  so  a  terminating  algorithm  cannot  exist.  However,  a  complete  procedure  for 
enumerating  unifiers  of  an  associative  operator  is  described  in  [Plotkin  72].  This  procedure  is  given 
for  one  associative  operator  with  unconstrained  symbols  of  any  arity.  Not  all  equational  theories  have 
decidable  unification  problems.  [Szabo  78]  shows  that  unification  in  the  associative  distributive  theory 
is  undecidable  and  [Arnborg  85]  shows  that  combining  associativity  with  one  sided  distributivity  and  a 
unit  element  gives  a  theory  with  an  undecidable  unification  problem. 

For  theories  in  which  a  unification  algorithm  is  known,  the  execution  times  of  many  have  been 
disappointingly  high.  These  observations  are  explained  by  some  recent  results  classifying  the  com¬ 
plexity  of  different  unification  problems.  The  unification  problem  in  the  commutative  theory  is  known 
to  be  NP-complete  [Garey  79],  and  in  the  AC  theory  to  be  NP-hard  [Kapur  85,  Chandra  84].  Unifica¬ 
tion  in  the  theory  of  right  and  left  identity  is  NP-hard  while  the  theory  of  one-sided  distributivity  can  be 
done  in  polynomial  time  [Arnborg  85].  A  restricted  case  of  unification  is  the  matching  problem,  in 
which  a  substitution  is  applied  to  one  term  to  make  it  equal  to  another.  [Benanav  85]  shows  that  even 
this  simpler  problem  has  an  NP-complete  decision  problem  in  both  the  AC  and  commutative  theories. 

Much  of  this  past  work  in  equational  unification  has  made  use  of  simplifying  assumptions  on  the 
structure  of  terms,  i.e. ,  algorithms  are  usually  developed  to  handle  terms  whose  operators  all  belong 
to  a  single  set  of  axioms.  In  most  cases,  the  above  unification  algorithms  were  designed  for  the 
variable-only  case,  possibly  with  constants.  Contrary  to  many  claims,  we  show  in  Section  2.4  that 
extensions  to  the  case  with  free  symbols  is  often  non  trivial. 

Under  the  current  approach,  every  time  a  new  axiom  is  added  to  the  theory,  a  new  unification 
algorithm  must  be  found  and  implemented  for  the  new  set  of  axioms.  The  work  of  [Pages  83b,  Fages 
84j  takes  steps  toward  remedying  this  situation  by  extending  the  unification  algorithm  for  AC  to 
handle  terms  containing  a  mix  of  theories  including  empty,  commutative,  and  AC  theories.  However, 
his  approach  is  still  ad  hoc  rather  than  generalized.  Adding  another  theory  to  his  algorithm,  for 
example  the  theory  of  left  distributivity,  would  require  modification  of  his  algorithm.  The  modified 
algorithm  would  have  to  consider  terms  containing  all  the  possible  combinations  of  operators.  In 
contrast,  this  thesis  describes  a  method  for  automatically  combining  theories. 

Figure  1-4  summarizes  some  previous  results  on  developing  equational  unification  algorithms, 
with  careful  attention  paid  to  whether  an  algorithm  is  a  solution  to  the  variable-only  case,  the  case 
with  free  symbols,  or  the  multiple  instance  case.  Algorithms  that  permit  constants  as  well  as  variables 


are  recorded  in  the  variable-only  case  column;  the  constants  are  added  in  some  theories  to  keep  the 
decision  problem  from  becoming  trivial;  they  play  an  uninteresting  role  in  the  problem  of  generating  a 
complete  set  of  unifiers,  except  to  eliminate  those  unifiers  that  equate  two  constants.  Algorithms  from 
[Hullot  80]  and  [Jouannaud  83]  are  based  on  narrowing,  a  process  for  unification  described  in  the 
following  section. 
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no  axioms 

x  +(t  +  :)  =  (x  +  v)+  x 
v  +  v  =  y +  x 
x  +  v  =  x 

x  *0  +-)  =  (x  m)  +  (.vt) 

(-»  +.» )*  f  =  (v  +.1  )*- 
Dj  and  Df 
x  +  0  =  0  +  v  =  x 

/.  ( x  *  i  )  =  /;(*)  +  /i(_i  ) 

-(-  x )  =  x 

-(<  \t  )  =  (->)•(--»  ) 

-(-x)=  r 

-(x  +  .i  )  =  (-v)  +  (-t) 

(x  +.i)  +  (-;)  =  at 
(- ax )  +  (  x  +>')=>' 
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JVU ,y),  i)-J'(f(x,z),y) 
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Figure  1*3:  Some  Common  Equational  Theories 
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Figure  1-4:  Known  Unification  Algorithms 


1 .5.2  Narrowing 


While  most  of  the  existing  unification  work  in  unification  has  required  human  invention  of  each 
algorithm,  the  unification  procedures  based  on  narrowing  [Slagle  74]  are  automatically  generated. 
For  equational  theories  representable  by  a  convergent  term  rewriting  system  there  is  method  for 
performing  unification  in  the  theory  of  the  rewriting  system  [Fay  79].  [Hullot  80]  gives  sufficient  con¬ 
ditions  for  termination  of  the  narrowing  process,  along  with  some  improvements,  and  [Jouannaud  83] 
generalizes  this  work  to  equational  term  rewriting  systems.  [Rety  85]  further  improves  on  the  ef¬ 
ficiency  of  the  narrowing  process  and  detects  cycles  in  the  unifiers. 

The  approach  described  in  this  thesis  serves  a  quite  different  purpose  than  the  work  on  unifica¬ 
tion  through  narrowing;  the  narrowing  procedure  does  not  assume  the  existence  of  equational 
unification  algorithms,  but  generates  a  procedure  based  on  its  axioms,  whereas  our  approach  might 
use  a  unification  algorithm  produced  by  narrowing  as  one  of  the  basic  pieces  to  be  combined.  For 
example,  the  work  in  narrowing  has  lead  to  unification  algorithms  for  theories  described  by  a  conver¬ 
gent  term  rewriting  system  in  which  all  right-hand  sides  are  either  a  single  variable  or  ground  term. 
These  theories  include  idempotence,  identity,  and  quasi-groups.  The  algorithm  described  in  this 
thesis  applies  to  narrowing  algorithms  for  theories  in  which  right  and  left  sides  of  all  axioms  are 
ground  terms,  e  g.,  1*1=0.  In  other  words,  given  a  theory  presented  by  ground  equations,  we  can 
automatically  generate  a  unification  algorithm  through  narrowing;  this  algorithm  can  be  combined 
using  our  approach  with  other  unification  algorithms. 

The  narrowing  process  is  interesting  from  a  theoretical  standpoint,  and  gives  quick  positive 
answers  to  the  question  of  existence  of  an  E-unification  algorithm  for  theories  presented  by  ground 
equations  or  unconfined  equations.  For  the  cases  in  which  narrowing  gives  a  unification  algorithm,  it 
will  also  solve  the  problem  with  free  symbols  or  multiple  instances,  but  can  solve  the  combining 
problem  for  only  very  limited  combinations  of  theories.  Moreover,  even  when  the  process  terminates, 
it  is  too  inefficient  to  be  practical  as  pail  of  a  larger  system  such  as  a  term  rewriting  completion 
procedure  or  resolution  system. 

1 .5.3  Combining  Theories 

Although  the  problem  of  combining  unification  algorithms  is  a  known  open  problem  [Shostak 
84,  Siekmann  84],  the  problem  of  combining  decision  procedures  for  first  order  theories  has  been 
studied.  Nelson  and  Oppen  provide  a  procedure  for  deciding  whether  a  formula  is  a  theorem  in 
combination  of  first  order  predicate  calculus  theories  [Nelson  79].  Their  algorithm  uses  a  set  of 
decision  procedures  for  the  theories  being  combined,  much  in  the  same  way  we  will  use  unification 


algorithms  ic  the  theories  being  combined  [Shosta*  84}  improves  on  the  algorithm  of  [Nelson  79]  by 
localizing  the  information  shared  between  algorithms.  This  yields  an  improvement  in  the  algorithm's 
efficiency  and  extendibility. 

The  similarity  between  the  structure  of  our  unification  algorithm  and  the  decision  procedure  of 
[Nelson  79]  is  apparent  when  unification  is  considered  in  the  Martelli  and  Montanari  style  of 
propagating  equalities  [Martelli  82],  [Kirchner  84a]  gives  an  algorithm  based  on  this  style  for  unifica¬ 
tion  in  the  decomposable  theories,  a  class  of  theories  in  which  a  natural  decomposition  process 
occurs  during  unification.  For  example,  if  /  is  a  symbol  that  does  not  appear  at  the  head  of  either 

side  of  any  equations  in  E,  then  the  problem  of  E-unifying  terms  of  the  form  /(s  . sn)  and  /(fr-.-,fn), 

modulo  E,  is  proved  equivalent  to  unifying  all  pairs  s|t  fr  1  <  i  <  n. 

[Kirchner  85]  has  independently  developed  an  algorithm  for  combining  unification  algorithms.  He 
generalizes  his  earlier  work  hy  defining  the  notion  of  a  decomposition  process  for  theories  in  which 
the  natural  decomposition  does  not  occur.  For  example,  in  the  the  commutative  and  AC  theories 
there  is  a  finite  set  of  possible  decompositions  related  to  the  set  of  possible  unifiers. 

The  algorithm  of  [Kirchner  85]  is  based  on  two  main  phases:  simplification  of  the  unificands,  and 
formation  of  substitutions.  His  algorithm  is  correct  for  a  slightly  smaller  class  of  theories  than  the 
algorithm  we  will  be  defining  In  particular,  if  a  theory  contains  a  ground  equation  in  an  irredundant 
aresentation,  then  Kirchner's  approach  does  not  work.  We  will  discuss  the  exact  details  of  his 
restrictions  on  equational  theories  in  Section  2.1.3  3  after  our  own  restrictions  are  presented. 
[Kirchner  85]  reports  that  his  implementation  works  faster  than  ours  on  some  typical  examples  using 
the  AC  theory.  This  may  be  because  of  earlier  discovery  of  clashes  using  his  approach,  since  he 
detects  all  clashes  during  the  simplification  phase  before  beginning  the  more  expensive  phase  which 
involves  cyclicity  testing. 


Chapter  Two 

A  Generalized  Unification  Algorithm 


This  chapter  presents  our  approach  to  generalized  unification.  The  presentation  is  bottom-up.  In 
Section  2. 1  we  state  some  assumptions  on  the  equational  theories,  Section  2.2  presents  a  description 
of  the  algorithm,  and  Section  2.3  gives  a  detailed  example. 


2.1  A  Generalized  Approach 

In  equational  unification,  a  unification  algorithm  must  be  discovered  and  implemented  for  each 
equational  theory  of  interest,  and  with  some  notable  exceptions,  (see  Section  1.5)  this  process  is  not 
automatic.  As  we  will  see,  the  problem  of  combining  algorithms  is  also  non  trivial.  Our  approach  is  to 
break  a  combined  unification  problem  into  pieces  that  we  know  how  to  solve  with  the  sub-theory 
algorithms,  and  then  to  combine  the  answers  for  each  these  sub-problems  to  get  a  solution  to  the 
whole  problem. 

Our  algorithm  is  recursive;  a  top-level  procedure  performs  the  steps  in  unification  that  are  com¬ 
mon  to  all  equational  theories  and  then  invokes  an  appropriate  equational  unification  algorithms  for 
sub- problems  particular  to  one  theory. 

2.1.1  Partitioning  Equational  Theories 

Our  first  underlying  assumption  is  that  the  sets  of  operators  handled  by  each  unification  proce¬ 
dure  are  disjoint.  Consider  the  following  example:  we  are  given  a  unification  algorithm  for  AC- 
unification  and  an  algorithm  for  unification  with  an  idempotent  constant  (IK-unification).  If  the  + 
operator  is  known  to  be  AC  and  have  an  idempotent  constant,  a,  (i.e.,  a  +  a  =  a),  our  technique  will 
not  automatically  generate  an  algorithm  for  AC-IK-unification  because  the  AC  and  IK  axioms  interact 
through  the  shared  symbol  + .  On  the  other  hand,  if  +  were  AC  and  ♦  were  IK  with  a ,  and  there  were 
no  other  axioms  in  E,  our  approach  would  generate  an  algorithm  for  this  theory. 

We  will  treat  each  unification  algorithm  for  an  equational  theory  as  a  "black-box,"  invoking  it  with 
certain  inputs,  but  never  examining  the  operation  within  the  box.  The  above  problem  with  +  being 
both  AC  and  IK  can  be  eliminated  by  considering  only  sets  of  axioms  with  disjoint  operator  sets. 
Formally  we  define  a  partitioning  on  the  axioms  presenting  E*. 
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Definition  Let  w  =  {E,  E2 .  En},  where  each  E  is  a  set  of  equations  it  is  a  partitioned 

presentation  of  an  equational  theory  E*  if  and  only  if: 

1.  JfE^rt  JfE)  =  0,  V  i*j  <  n 

2  U  E(  is  a  presentation  of  E*,  and 

3.  0€w. 

Each  of  the  E^s  presents  a  theory,  E\,  called  a  sub-theory  of  E*.  The  empty  set  of  equations  in  (3) 
represents  the  empty  equational  theory,  which  is  a  sub-theory  of  any  theory.  Semantically,  the  union 
of  a  set  of  equational  theories  corresponds  to  taking  the  intersection  of  their  models,  Jl>(E)  = 
Q  JME). 

The  partition,  w,  naturally  defines  an  equivalence  relation  on  function  symbols.  Let  f.  =/„  if  and 

1  V  i 

only  if  either: 

1 .  There  exists  E(  €  it  such  that./',  €  7(E()  and /2  £  eJ(Et). 

2.  Or,  for  all  E(  €  ir,/,  €  ‘JfE^  and/2  £  CJ(E). 

The  equivalence  class  of  symbols  containing  /  will  be  denoted  (/),  e.g..  if  one  of  the  sub-theories  is 
ACZ  with  +  as  the  AC  operator  and  0  as  the  identity  constant,  then  [  +  ]  =  { +  ,  0}.  Function  symbols 
that  do  not  appear  in  any  of  the  E 's  are  called  uninterpreted  and  all  belong  to  one  equivalence  class. 

This  equivalence  relation  will  provide  a  convenient  way  of  naming  unification  algorithms  of  sub¬ 
theories.  It  is  not  quite  correct  to  refer  to  an  operator  as  being  'in’  a  particular  sub-theory,  since  each 
sub-theory  has  the  same  fixed  signature,  F.  We  therefore  fix  the  partitioned  presentation  w  for  E*  and 
refer  to  the  set  of  function  symbols  appearing  in  Ej  as  the  constrained  function  symbols  for  E*.  For 
technical  reasons,  we  will  let  the  set  of  uninterpreted  symbols  be  the  constrained  symbols  for  the 
empty  theory.  The  unification  algorithm  for  the  theory  constraining  F  will  be  denoted  E F-unify  or 
Etfunify,  if/€F. 

An  example  should  help  clarify  our  definitions.  Let  E*  be  presented  by  the  axioms  in  Figure 
2-1  and  let  the  signature,  F,  be  the  set  {  +  ,*,  a,  b,  •,/,  g)  with  appropriate  arities.  it  -  {Er  E2,  E3, 
E4}  is  a  partitioned  presentation  of  E.  it  is  said  to  be  a  minimal  partition  because  none  of  the  elements 
of  it  can  be  divided  further  without  losing  the  disjointness  property  on  function  symbols.  In  general 
we  will  use  minimal  partitions  because  this  results  in  the  simplest  sub-theories,  although  minimality  of 
partitions  is  not  required. 

There  is  a  final  technicality  to  clarify  before  beginning  the  discussion  of  our  algorithm.  Although 


<EvE2’E3.E4} 

E,:  x  +  y  =  y  +  x 

(x  +  y)  +  z  *  x  +  0’  +  x) 
E2:  x  •  y  =  y  •  x 
a  *  a  -  a 

(x  •  y)  •  z  =  x  *  tv  •  r) 

E3:  x  •  y  =  y  •  x 

(x  •  y)  •  z  -  x  •  O'  •  jt) 


Classes  of  F:  (F^  F2,  F3,  F4) 
F,  *  5(E,)  =  {  +  } 
F9-5(EJ-{«,fl} 


F,  =  S(EJ  =  {•} 


F4  *{!>,/,  g) 


Figure  2-1 :  A  Partitioned  Presentation 


we  speak  of  an  E-unification  algorithm  for  a  particular  equational  theory,  each  algorithm  is  really  for 
an  isomorphism  class  of  equational  theories.  For  example,  if  both  +  and  •  are  AC,  as  in  Figure  2-1, 
we  can  use  the  same  algorithm  for  unifying  a  pair  of  terms  containing  +  or  a  pair  of  terms  containing 
v  The  two  equational  theories,  +  AC  and  *  AC,  are  not  equal  theories,  but  the  isomorphism  is  so 
natural  that  we  would  normally  consider  them  to  be  the  same.  A  difficulty  arises  when  the  two 
theories  are  combined,  i.e.,  when  terms  to  be  unified  contain  more  than  one  operator  with  the  same 
equational  properties.  To  resolve  this  issue,  each  E-unification  algorithm  is  parameterized  over  the 
set  of  names  of  its  constrained  operators.  In  this  example,  AC-unification  for  +  and  AC-unification 
for  •  are  both  instances  of  the  same  E-unification  algorithm.  For  the  purpose  of  this  discussion,  we 
will  assume  a  different  unification  procedure  exists  for  each  instance  of  a  theory,  although  in  the 
implementation  we  do  not  duplicate  the  actual  code. 


2.1 .2  Some  Basic  Functions 

Our  algorithm  begins  by  transforming  the  input  terms  into  simpler  terms  containing  only  operators 
from  a  subset  of  the  axioms,  a  subset  for  which  there  is  a  known  E-unification  algorithm.  The 
information  lost  in  the  transformation  is  saved  in  the  form  of  a  substitution.  This  substitution  is 
combined,  through  E-unification  of  substitutions,  with  each  sub-theory  unifier  of  the  transformed 
terms.  Section  2.1 .2.1  describes  this  transformation  process  on  terms  and  section  2.1 .2.2  describes  a 
procedure  for  unifying  substitutions. 


2. 1.2.1  Homogeneous  Terms 

A  term,  t,  is  called  homogeneous  with  respect  to  a  set  of  function  symbols,  F,  if  and  only  if 
cJ(t)  C  F.  We  define  a  homogenizing  function,  Homog,  to  convert  an  inhomogeneous  term  (i.e.,  a  term 
containing  operators  that  are  not  in  F)  into  a  homogeneous  term.  The  basic  operation  of  Homog  is  to 
replace  all  maximal  subterms  whose  top  function  symbol  is  outside  F  with  a  new  variable. 

Definition.  Let  F  be  a  set  of  function  symbols  and  t  be  a  term,  then  Homog(t,  F)  is  defined  as: 

1.  If  f  is  a  variable,  then  Homog(t,  F)  =  /. 

2.  If  f  =  f{ty...,tn)  and/ €  F,  then  Homog(t,  F)  =  f(Homog(ty  F) . Homog(tn,F)). 

3.  If  t  =  /  )  and  ft  F,  then  Homog(t,  F)  =  v,  where  v  is  a  new  variable. 

As  defined,  Homog(t,  F)  is  not  a  function  but  is  unique  for  t  up  to  names  of  the  new  variables. 
Technically,  we  should  be  more  precise  about  the  new  variables  that  are  used,  for  example,  in  case 
(2)  we  assume  any  new  variables  in  HomogitJ  are  disjoint  from  both  the  old  variables  in  t.  and  new 
ones  in  Homog{t.),  for  i*j.  To  assure  our  algorithm  is  protective,  these  new  variables  must  also  be 
disjoint  from  the  set  of  protected  variables.  We  formalize  the  naming  of  new  variables  in  Appendix  A. 

Taking  F  =  {a  ,  •},  we  have  the  following  values  for  Homog: 

Homog(x  *(a  +y),F)  *  x  •v1 
Homog(x  >(a  *b  ),  F)  =  x  >(a  «v2) 

Homog(x  +>> ,  F)  =  Vg. 

In  general,  homogenization  of  a  term  is  done  with  respect  to  some  equivalence  class  of  F  as  defined 
by  ir,  usually  the  equivalence  class  of  the  root  symbol.  We  use  the  notation  f  to  denote 
Homog(t,[t.head]). 

In  forming  a  homogeneous  term,  part  of  the  structure  of  the  original  term  is  lost.  To  take  a 
homogeneous  term  back  to  the  term  from  which  it  was  formed,  we  find  a  preserving  substitution. 
Notice  that  t  is  an  instance  of  f  and  we  can  therefore  find  the  match  of  t  for  f,  in  this  case  called  a 
preserving  substitution,  by  Preserve (t,  t).  Preserve (t,  T)  maps  each  new  variable  in  f  to  the  term  it 
replaced  in  t.  We  distinguish  the  preserving  substitutions  from  normal  matching  because  the  prevers¬ 
ing  substitution  is  unique  for  t  within  variable  names  in  its  domain. 

Definition.  Let  t  be  a  term  and  f  be  its  homogenous  form.  The  preserving  substitution  for  t  and  f  is  a 
substitution  y  such  the*: 


9(y)CtW&Yf=f. 


2.1.2  2  Unification  of  Substitutions 


This  section  describes  our  method  for  combining  sub  problem  unifiers,  which  involves  unification 
of  substitutions. 

Definition  Given  a  set  of  equations.  E,  a  substitution,  o,  is  said  to  E  unily  two  substitutions,  qp.,  and 
9  2  if  and  only  if: 

<*  °<P, 

We  need  an  effective  procedure,  call  it  map  unify,  for  finding  unifiers  of  two  substitutions,  <p1  and 
<jr 2.  In  looking  for  a  unifier  of  two  substitutions,  as  in  testing  for  equality  of  substitutions,  we  restrict 
our  selves  to  the  domain  of  the  variables  V  =  t  )UEJ(«jp2).  A  corresponding  pair  of  terms  is  defined 

to  be  a  pair,  <ty  f2>.  where  r,  =  9  v  and  f2  =  <r2v  *or  some  '  €  v-  If  we  can  unify  each  corresponding 
pair  of  terms  in  substitutions  sequentially,  accumulating  the  unifiers,  and  applying  the  results  to 
remaining  pairs,  the  end  result  will  be  a  set  of  unifiers  of  the  substitutions.  The  routine  in  Figure 
2  2  performs  thu  desired  function,  map-unity  assumes  the  existence  of  our  main  algorithm,  CR-unify, 
because  the  two  procedures,  map-unify  and  CR-unify.  are  mutually  recursive.  (For  technical  reasons 
that  will  be  made  clear  in  the  termination  proof,  the  variables  in  3(<f2)  are  unified  after  the  variables  in 
3(9,).) 


map  unify  =  proc  (y  ,  <p  :subst)  r et u r n S ( X :subst_set) 

V=  3<?,>-3(*2) 

V2:=9(<p2) 
i :  =  0 

for  j  =  1  to  2  do 
for  v  in  V.  do 
i :=  i  ♦  1 

Zi :=  {wi°ai-i  I  °m€2m  &  u^CR-unify(v.  yrp^,  oM«p2v)} 

end 

return(Z.) 
end  map-unify 

Figure  2-2:  Procedure  map-unify  for  Unification  of  Substitutions 


1  i  .  ip-iijiii.'  u. j  ■  ..  ,  i  .■  ■  7  ■  .»  i*.'  j. x ti 


2.1.3  Restrictions 

The  CR-unily  procedure  presented  in  this  thesis  is  correct  for  only  a  restricted  class  of  equational 
theories.  Two  syntactic  restrictions  on  the  axioms  in  v,  confinement  and  regularity,  are  sufficient  to 
show  correctness. 

2. 1.3.1  Confined  Theories 

The  first  restriction  will  be  to  eliminate  those  equational  theories  in  which  two  E-equal  terms  have 
head  symbols  that  are  constrained  by  different  sub- theories.  Because  we  assume  the  axiom  sets  are 
disjoint,  this  case  can  occur  only  if  there  is  a  equation  in  the  theory  of  the  form  t  =  s,  where  either  t  or 
s  is  a  variable  and  the  other  term  is  a  non-variable.  If  this  kind  of  equation  is  in  one  sub-theory,  then 
there  will  also  be  instances  of  this  equation  in  the  entire  theory,  where  the  heads  are  not  equivalent 
modulo  v.  Equations  of  this  form,  between  a  variable  and  a  non- variable,  will  be  referred  to  as 
non  confining ,  because  they  provide  a  means  of  deducing  equations  whose  roots  are  not  confined  to 
the  same  equivalence  class  of  F.  A  set  of  equations  containing  no  non  confining  equations  is  called 
confined ;  this  terminology  also  applies  to  theories  since  they  are  closed  sets  of  equations. 

An  example  of  an  unconfined  theory  is  the  theory  of  idempotence.  Let  E  be  the  theory  presented 
byw  =  {{*  •  x  =  x),  0},  and  let/ be  a  function  symbol  in  F.  The  equation /(x)  • /(a  )  |/(x)isinE, 
even  though  •  and  f  are  in  different  equivalence  classes.  The  problem  caused  by  having  roots  in 
different  equivalence  classes  will  be  apparent  in  the  description  of  CR-unify,  where  we  begin  the 
unification  process  by  unifying  in  the  sub  theory  of  the  roots.  Lemma  1  will  state  that  this  restriction 
can  be  made  on  the  axioms,  rather  than  the  theory,  and  Theorem  I  will  give  the  desired  property  on 
head  symbols  of  equations.  We  postpone  the  presentation  of  these  proofs  until  Chapter  3. 

2. 1 .3.2  Regular  Theories 

The  second  restriction  is  on  the  sets  of  variables  in  equations  of  the  theory.  The  problem  comes 
from  variables  that  occur  on  one  side  of  the  equation,  but  not  on  the  other.  An  equation,  r  =  s,  is 
regular  if  and  only  if  T(t)  =  T\s).  We  extend  this  to  sets  of  equations  by:  E  is  regular  if  and  only  if  all 
equations  in  E  are  regular.  As  in  the  case  of  confined  theories,  we  can  restrict  ourselves  to  reqular 
theories  by  restricting  the  set  of  axioms.  Lemma  3  states  the  equivalence  of  these  two  properties  and 
Theorem  II  gives  sufficient  conditions  under  which  a  variable  and  a  term  containing  that  variable  are 
not  unifiable. 

We  have  eliminated  some  interesting  theories,  such  as  idempotence,  identity  and  minus,  however, 
there  are  still  many  interesting  theories  that  are  both  confined  and  regular.  The  distributive  theory, 
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the  homomorphism  theory,  and  any  theories  presented  by  ground  equations  are  confined  and 
regular.  In  addition,  any  theory  in  which  the  right  side  of  each  equation  is  identical  to  the  left  side 
within  a  permutation  of  variables,  i.e.,  a  permutative  theory,  is  confined  and  regular. 

Recall  from  Section  1.4.1  that  one  of  our  reasons  for  studying  E-unification  was  to  avoid  termina¬ 
tion  problems  in  systems  that  use  equations  as  oriented  rules.  A  permutative  equations  oriented  in 
such  a  manner  will  always  a  program  to  loop,  since  they  can  be  applied  repeatedly.  An  unconfined 
equation,  if  oriented  into  a  rule,  will  not  lead  to  termination  problems  since  the  variable  side  of  the 
equation  is  simpler  than  the  non-variable  side.  Consequently,  the  equations  that  most  often  lead  to 
termination  problems  in  an  application  can  be  handled  by  our  unification  algorithm,  whereas  equa¬ 
tions  that  cannot  be  handled  by  our  unification  algorithm  will  often  not  cause  termination  problems  in 
an  application. 

2. 1 .3.3  Strict  and  Strongly  Complete  Theories 

In  this  section  we  digress  in  order  to  give  a  careful  distinction  between  ou  .estriction  to  confined 
regular  theories  and  the  restrictions  of  [Kirchner  85].  As  discussed  in  Section  1.5.3,  Kirchner’s  algo¬ 
rithm  is  based  on  two  main  phases:  simplification  of  unificands  and  formation  of  substitutions.  His 
algorithm  for  simplification  is  correct  and  terminating  for  the  confined  theories  (he  calls  them 
non-potent  theories.)  The  simplification  algorithm  produces  a  simplified  system  of  unificands  and  is 
complete  in  a  sense  defined  in  that  work. 

The  second  phase,  formation  of  substitutions,  involves  forming  substitutions  from  simple  pairs  of 
unificands,  e  g.,  one  variable  and  one  non-variable.  In  general,  discovery  of  all  unifiers  may  require 
another  phase  of  simplification  after  some  substitutions  have  been  formed,  and  this  makes  it  difficult 
to  guarantee  termination.  To  avoid  having  to  alternate  between  simplification  and  substitution  for¬ 
mation,  Kirchner  places  further  restrictions  on  the  equational  theories  so  that  a  single  phase  of 
simplification  followed  by  a  single  phase  of  substitution  formation  gives  a  complete  set  of  unifiers. 

The  correctness  of  his  unification  algorithm  depends  on  the  theories  being  strongly  complete  and 
strict  as  well  as  confined. 

Definition.  A  theory  E*  is  said  to  be  strongly  complete  if  and  only  if  for  all  variables,  x,  and  terms,  t, 
there  exists  2,  a  complete  set  of  unifiers  of  x  and  t  such  that: 

Vo€Z,9(o)  -  {x}. 

As  an  example  of  a  theory  that  is  not  strongly  complete,  consider  the  theory  presented  by  a  +  b  =  a . 
This  theory  is  not  strongly  complete  because  the  terms  x  *y  and  x  are  unifiable  by  the  substitution 


{<«-.;  r-'  }.  but  not  by  any  substitution  having  only  a  initsdomain.  All  theories  presented  by  ground 
equations  that  have  at  least  one  operator  of  arity  two  or  higher  in  the  presentation,  will  fail  the  strong 
completeness  test. 

The  notion  of  strictness  of  an  equational  theory  depends  on  the  following  ordering  on  pairs  of 
terms. 

Definition.  <t,  s  >  if  and  only  if: 

1. 1  is  variable,  t'  is  not  a  variable  &  or 

2.  t  is  variable,  s'  is  not  a  variable  &  tCffs'),  or 

3.  1  or  2  is  true  with  s  for  f. 

Consider  a  set  of  unificands.  i.e ,  pairs  of  terms  to  be  unified,  for  which  we  want  to  find  a  single 
substitution  that  unifies  all  pairs  in  the  set.  If  such  a  unifier  exists,  then  we  take  the  transitive  closure 
of  -'ir  on  the  set  and  determine  whether  or  not  the  transitive  closure  is  strict.  (A  strict  partial  order  is 
one  which  is  irreflexive  and  asymmetric.)  A  theory  is  strict  only  if  the  transitive  closure  of 
denoted  -<  jA  is  strict  More  precisely: 

Definition  A  theory  E*  is  strict  if  and  only  if  for  all  sets  of  pairs  of  terms,  P: 

3o  such  that  (V<f,  s>€P,  at  =as) 

=>  -<<£*  is  strict  on  P. E 

The  simplest  examples  of  theories  that  are  not  strict  are  those  in  which  a  variable  is  unifiable  with  a 
term  containing  that  variable.  For  example,  in  the  theory  presented  by  g  (g  (a  ))  =  g  ( » ).  ,i ;  0  )  and  >'  are 
unifiable.  but  <«(*),  x>  is  less  than  itself  by  -<<£,  so  -<<£  is  not  strict  on  P-{<gO),  \  >}  Any  theory 
with  an  equation  in  which  one  side  is  a  subterm  of  the  other  will  be  non-strict. 

Kirchner’s  restrictions  on  the  equational  theories  are  less  general  than  ours  because  both  al¬ 
gorithms  require  confinement  and  because  strictness  implies  regularity.  For  evidence  of  the  latter,  we 
construct  a  pair  of  unifiable  terms  that  cause  a  trivial  cycle  in  the  -<<£*  ordering  in  any  non-regular 
theory.  Given  any  non  regular  equation  in  E*,  it  will  be  of  the  form  t  =s,  where  3 »’ ,  r€f  &  v£s.  The 
pair  of  terms  v  and  t  are  unifiable  in  E*  by  the  substitution  {v*-s},  but  -<<£*  is  not  strict  on  {<  v ,  f>} 
since  v€f. 

Note  that  the  restrictions  of  strong  completeness  and  strictness  are  conditions  on  the  infinite  set 
of  equations  in  the  theory,  whereas  our  restrictions  are  syntactic  checks  on  any  presentation  of  the 
theory.  Moreover,  strictness  implies  regularity  but  not  the  reverse,  so  his  algorithm  is  slightly  less 
general  than  ours.  Examples  of  theories  handled  by  our  algorithm  but  not  by  Kirchner’s  include  most 
theories  presented  by  ground  equations,  e  g.,  a  +  b  =  a,  and  the  confined  regular  theories  containing 
equations  with  one  side  a  subterm  of  the  other,  e  g.,  £(&(*))  =  g(x). 


33 


2.2  The  Algorithm 

The  ma  n  a'gonthm.  an  algorithm  for  generalized  equational  unification  of  terms,  is  presented 
below.  Tne  basic  assumptions  are  summarized  here  E*  is  a  strictly  consistent  equational  theory, 

with  a  fixed  partitioned  presentation.  ■»  =  {E1 En}  For  each  E*.  there  is  a  known  unification 

algorithm  called  E  -unify,  which  returns  a  complete  set  of  unifiers,  given  any  two  terms  that  are 
homogeneous  in  the  constrained  symbols  of  E*.  We  also  assume  that  E'  is  confined  and  regular, 
although  the  reason  for  these  last  two  requirements  will  not  become  apparent  until  the  proof  of 
correctness  in  Chapter  3. 

The  CR  unify  procedure  is  ^  ven  in  Figure  2-3.  If  >  and  s  are  variables  or  if  /  is  a  variable  not 
occurring  in  s  then  the  terms  are  unifiable  by  the  substitution  {f*-s}.  If  t  and  s  are  both  non  variables 
with  root  symbols  from  different  equivalence  classes  of  F.  then  any  substitution  instance  of  f  and  s  will 
also  have  root  symbols  with  this  property,  so  t  and  s  are  not  unifiable  (see  Theorem  I). 

If  both  t  and  s  are  non  variables  with  root  symbols  in  the  same  equivalence  class,  then  we  form 
homogeneous  terms  and  determine  the  preserving  substitutions.  Preserve (t,  f)  and  Preserves,  s). 
The  union  of  these  substitutions  is  well  defined  because  the  domain  of  each  contains  only  new 
variables  from  f  and  s,  and  these  two  variable  sets  are  disjoint  by  construction.  We  will  refer  to  their 
union,  y.  as  the  combined  preserving  substitution.  P  is  found  by  unifying  homogeneous  terms  in  the 
appropriate  sub  theory,  and  the  preserving  substitution  is  combined  with  each  p  €  P  by  unification  of 
substitutions.  The  set  of  unifiers.  2,  is  returned  after  restricting  the  domain  to  the  variables  in  t  and  s. 
(Note:  for  readability,  we  have  extended  the  notation  for  restricting  substitutions  to  denote  restriction 
of  a  set  of  substitutions,  i.e.,  Z|v  =  {u|vl  o€I}.) 


CR-unity  =  proc  (/,  s:  term)  returns  (subst_set) 
case 

is_variable(t)  and  is_variahle{s)  ^ 

return({{t«-s}})  %  case  1 

is_variable(f)  and  ~is_variable(s)  =» 

return (CR-variable-unify(t,  s))  %  case  2 

is_variable(s)  and  -is_variable(/)  ^ 

retu  rn{CR-variable-unify{s,  t))  %  case  3 

t.head  *  s.head  ^ 

ff 

return  (0)  %  case  4 

t.head  =  s.head  =>  %  case  5 

V 

y  :  =  Preserve (t,  f )  U  Preserved,  s) 

P:  =  E[t  head]Unify{T'  S) 

2  :  =  U  map-unify(p,y) 

p€P 

return(S|r(f)ur(s)) 

end 

end  CR-unify 

CR-variable-unify  =  proc  (v:  variable,  s:  term)  returns  (subst_set) 
y  :  =  Preservers,  s) 
case 

v  £  T{s)  =*  %  case  A 

return  ({{v  «-s}}) 

v  €  T(s)  &  v  €  5(y)  =>  %  case  B 

P:  =  Els.heacl]-unify(v'® 

2 :  =  U  map-unify{p,  y) 
p€P 

return(2|{y}utls)) 

v  €  1%s)  A  v  6  5(y)  =>  %  case  C 

return(0) 

end 

end  CR-variable-unify 

Figure  2-3:  The  CR-unify  Procedure  for  Equational  Unification 


2.3  An  Example 


This  example  shows  unification  in  the  equational  theory,  E,  as  presented  in  Figure  2-1.  Let  the 
input  terms  be,  t  =  b  +  (x  •>■)  and  s  =  a  +  z .  Both  are  non-variable  terms  and  the  sub-theory  of  the  root 
operator  ( +  in  both  cases),  is  presented  by  Er  The  relevant  axioms  for  this  example  are  Ev  the  AC 
theory  for  + ,  E2  the  AC  theory  for  •  with  idempotent  constant  a ,  and  E4,  the  empty  theory  with 
uninterpreted  symbols  b ,/,  and  g . 

Calling  CR-unify(t,  s),  we  find  that  case  5  of  CR-unify  is  appropriate  for  two  terms  with  roots 
constrained  by  the  same  sub-theory.  Following  this  branch,  we  compute  the  homogeneous  terms,  f 
and  s.  The  set  of  constrained  symbols  for  E1  is  { +  }. 

f=v1  +  v2  s  =  v3  +  z, 

The  preserving  substitutions  are: 

Preserve (/,  f)  =  {v^*-b,  v2*-x*>}  Preserved,  s)  =  {v3«-a} 
and  the  combined  preserving  substitution  y  =  Preserve (1,  T)°Preserve(s,  s)  is: 

{v  1*-Z» ,  v2«-jr  *y,  v3«-a} 

The  homogeneous  terms  are  unified  in  the  sub-theory  E*.  the  AC  theory  for  +  AC  unifying  f  and 

v 

s  results  in  a  complete  set  of  AC-unifiers.  This  set  will  contain  two  unifiers  that  are  within  =  of: 

p  1  =  {VV1’Z4-V2> 

p2=  {vfv2,Z+vA). 

We  proceed  by  calling  map-unity  with  p  and  y.  (Both  p t  and  p2  will  be  considered  eventually  and 
the  choice  of  which  unifier  to  look  at  first  is  arbitrary.) 

map-unify(pv  y)  =  map  unify({v 3*-v y  z+v 2),{v ^b ,  v 2<-x  *y ,  v 3*-y} 

Recall  that  a  substitution  maps  each  variable  outside  its  domain  to  itself.  The  corresponding  pairs 


of  terms  are  thus: 

from  py 

fVi  =  v, 

PiVz=v2 

PlV3=V1 

from  y: 

y  z  =  z 

r: 

II 

O- 

yv2  =  x»y 

yv3  =  a 

In  this  example,  each  recursive  call  to  CR-unify  will  yield  a  singleton  set  of  unifiers,  although  this  will 
not  be  true  in  general.  (If,  at  any  point,  more  than  one  unifier  was  returned,  we  would  proceed  with 
each  in  depth-first  fashion.)  We  will  show  the  inputs  to  each  call  to  CR-unify,  the  resulting  unifier  set, 
and  the  effect  of  applying  the  unifier  on  the  pairs  of  terms.  The  pair  of  terms  for  z  is  unified  first, 
because  it  is  the  only  element  of  V1;  the  order  of  the  rest  of  the  calls  is  arbitrary. 


1  v  R  a"  :  J  returns  {{:->  2}} 


from  py  i  2 

from  y:  »  2 

2.  CR-unifyV'.  v ,)  returns  {{»■  ^b}) 

fromp(:  >2 

from  y:  v 

3.  CR-umty(b .  v  *_t  )  fails,  since  h  and  •  are  not  in  the  same  equivalence  class. 

This  ends  the  call  to  map  unify  for  py 

We  call  map  unify  again  this  time  with  p2  and  y. 

mapun  ty{p2.y)  =  map-unity({v  2,  :  «-i  ,}.{v  \'2*-x*y,  vfy] 


The  corresponding  pairs  of  terms  are: 


from  p2:  P 2z  =  vi 

P2V  i  = 

P2'-2=v2 

P2V3=V2 

from  y:  y:=:  y  v^-b 

The  following  sequence  of  calls  to  CR  unity  results. 

Yv2  =  x  •) 

1.  CR-unily(z,  vt)  returns  {{r •*-v  t}} 

from  p^:  v, 

V1 

V2 

v2 

from  y:  v 

b 

x*y 

a 

2.  CR-unify(b ,  v  t)  returns  {{ v  1«-6 }} 

fromp^  b 

b 

V2 

V  2 

from  y:  b 

b 

x  *y 

a 

3.  CR-unify(x  •}’,  r  2)  returns  {{ v  2*-jc  •>■}} 

fromp^  b 

b 

x  *y 

x-y 

from  y:  b 

b 

x  *y 

a 

4.  CR-unity(x  *y,  a)  calls  E2-unity  since  both  terms  are  homogeneous  in  {•,  r;}.  This  results  in 
the  singleton  set  of  unifiers  {{x  j  }} 

fromp^  b  b  a*a  a*a 


b  x  •)  a 

b  b  b 

b  x  *y  a 
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from  y:  b  b  a*a  a 

Composing  the  unifiers  from  steps  1  through  4,  we  get  the  substitution: 

{j x*-a  ,y*-a,z*-b,v  y*-b,v2*-a  *0} 
or  restricting  to  the  variables  in  t  and  s: 

{jr  *-a  ,y*-a,z*-b}. 

We  check  that  this  is  indeed  a  unifier  of  t  and  s  by  applying  and  testing  for  E-equality. 

b  +  (a  *a)  =  a  +  b 


2.4  Difficulties  in  Exending  CR-unify 

The  restrictions  of  confinement  and  regularity  were  carefully  chosen.  Two  arguments  will  make 
this  point  clear.  Moot  importantly,  we  will  prove  in  Chapter  3  that  the  restrictions  are  sufficient,  i.e., 
that  CR-unify  is  consistent,  complete,  and  terminating  for  the  confined  regular  theories.  This  section 
is  devoted  to  showing  the  necessity  of  our  restrictions.  We  will  show  that  CR-unify  is  not  complete  for 
theories  that  are  either  unconfined  or  non -regular.  A  great  deal  of  effort  was  put  into  the  design  of  the 
CR-unify  algorithm,  and  a  number  modifications  attempted  for  the  purpose  of  weakening  the  restric¬ 
tions  of  confinement  and  regularity.  Although  none  of  these  modifications  solve  the  more  general 
problems,  the  incorrectness  was  not  always  obvious.  We  include  some  examples  in  this  section  that 
point  to  problems  in  the  modified  algorithms. 

The  modified  algorithms  will  be  within  the  framework  of  our  approach,  i.e.,  E*  will  have  a  par¬ 
titioned  presentation  defining  the  sub-theories  and  these  sub-theories  will  be  assumed  to  have  com¬ 
plete  and  terminating  unification  algorithms.  Since  the  problem  with  CR-unify  is  completeness,  the 
modifications  will  involve  changing  a  failure  case  to  a  non-failure  case. 

The  first  example  shows  that  CR-unify  is  not  complete  for  unconfined  theories  because  case  4  of 
the  procedure  signals  failure  when  the  head  symbols  of  the  two  terms  are  constrained  by  different 
sub-theories. 


Example  1. 

E  ,:/(*)  =  Jr 
E2.g(jc)  *  x 


t  =  /(*) 


s  =  gtk) 


Case  4  of  CR-unify  applies  because  f*g,  so  CR-unify  returns  the  empty  set.  However,  the  substitu- 
tion  {.<  *-y  }  is  a  unifier  of  t  and  s. 

Note  that  if  we  replace  case  4  of  CR-unify  with  two  recursive  steps,  one  for  each  sub-theorv,  the 
problem  in  Example  1  will  cause  the  procedure  to  loop. 


s-y: 


•Vv»J 


V 


>  vi 
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Example  2. 

^  =  /(-*)  S,  =  », 

Y,  =  {v,^^)} 

f2  =  2  ,  $,  =  «(V) 

Y2  =  {v2-/(*)} 

E^-unify(f(x),  v  returns  the  single  unifier 
P  =  (v,*-/(Jr)}. 

Map-unify{y  v  p)  call  CR-unify  on  : 

t' =  gb>)  s'  =/(x) 

which  is  the  original  problem. 

The  procedure  will  similarly  loop  on  the  second  recursive  call  that  starts  with  E2-unifying  f  and  fg. 

Example  3  is  another  example  in  which  a  unifier  exists,  but  the  CR-unify  algorithm  will  fail  to  find  it. 
In  this  example  case  C  of  CR-variable-unify  is  the  incomplete  step. 

Example  3. 

E1:/(0,x)  =  x  t  =  f(x ,  g(y ,  z))  s  =  z 

E2:  g(1,  Jf)  =  x 

This  example  fails  in  case  C  of  CR-variable-unify,  because  z  occurs  below  an  inhomogeneous  sub¬ 
term  of  s,  although  the  substitution  {x*-0,y*-2)  is  a  unifier  of  1  and  s. 

Again,  if  we  replace  case  C  of  CR-variable-unify  with  a  recursive  step  as  in  case  B  of 
CR-variable-unify  and  case  5  of  CR-unify,  the  procedure  will  loop. 

Example  4. 

?=/(*.»,)  s  =  z 

Y  =  (v^gO'.z)} 

E ^-unify(f(x,  v^,  z)  returns  a  single  unifier 
P  =  {*♦/(*,  •',)}• 

Map-unify{y,  p)  will  call  CR-unify  on: 

t)  =  v1  s1  =  gly,  z)and 

t2  =/(*,  v,)  s2  =  z. 

CR-unify(v  ,  g(>’.  z))  returns  a  single  unifier: 

<r1  =  {v^S(y<  z)}. 

Applying  this  to  the  second  pair  of  terms  gives: 

<V2  =/(■*.  £0\z))  <ts2  =  z, 
which  is  the  original  problem. 
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The  on  '  cahon  problem  in  Examples  3  and  4  are  correctly  solved  by  replacing  »  with  a  constant 
or  b>  E  matching  :  by  <  as  suggested  by  [Tiden  85]  However,  Example  5  shows  that  these  methods 
do  not  work  in  general. 

Example  5 

E,:/(v.  x )  =  v  t  -  h\J\x,y))  s  =  x 

E2:MM-x))  =  h{x) 

f=  *(v  t)  S  =  c, 

where  (■  t  is  a  new  constant. 

The  terms  A  ( v  and  c ,  are  not  E2  unifiable.  so  this  procedure  will  fail  to  find  the  unifier,  {x *-h(v2), 

}  -A(ra)}. 

Intuitively,  the  problem  illustrated  in  Examples  4  and  5  is  that  the  evidence  of  z  in  t  is  lost  when  t  is 
homogenized,  tf.  instead  of  replacing  *•(.» ,  :)  with  a  new  variable,  we  replace  it  with  then  the 
algorithm  finds  the  correct  unifier.  Unfortunately,  this  technique  will  not  work  in  general  for  non¬ 
regular  theories  as  shown  in  Example  6  Like  Example  5.  Example  6  is  also  a  counter-example  to  the 
idea  of  replacing  a  variable  to  be  unified  with  a  constant.  In  Example  5  the  theory  is  regular  by  not 
confined,  and  in  Example  6  the  theory  is  confined  but  not  regular. 

Example  6. 

£,:/(.»,  0)  =  0  t  =  h(f(x,y))  s  =  y 

E2  0  (h  unconstrained) 

I'  =  h{y)  s  =  y 

where  t'  is  the  homogenized  form  of  f,  but  y  is  used  in  place  of  a 
new  variable  because  it  appears  in  /(.» ,  \ )  and  it  is  the  term  with 
which  t  is  being  unified. 

Unifying  h(j  )  and>  in  the  empty  theory  will  fail,  so  the  E-unifier,  «-0},  will  not  be  found. 

Examples  corresponding  to  3  and  4  also  exist  for  non-regular  theories.  The  unification  problem  in 
Example  6  shows  that  case  C  of  CR-variable-unify  is  not  complete,  and  that  modifying  case  C  to 
perform  recursion  will  make  the  process  loop. 

The  examples  themselves  are  interesting,  but  a  general  conck'sion  can  also  be  drawn,  namely, 
that  the  properties  of  completeness  and  termination  conflict.  These  examples  also  motivate  the 
careful  proofs  in  Chapter  3,  since  some  of  the  completeness  problems  would  not  arise  in  the  intuitive 
arguments  of  the  lemmas,  but  would  arise  in  the  careful  inductive  proofs  of  the  theorems. 


We  also  note  that  the  trivial  failure  cases  in  classical  unification,  i.e.,  clashes  and  cycles,  are  no 
longer  failure  cases  in  many  equational  unification  algorithms.  While  the  relaxation  of  clash  detection 
has  been  treated  in  the  literature,  the  difficulty  of  detecting  cycles  in  theories  such  as  idempotence, 
where  unifying  f(x ,  x )  and  x  does  not  cause  a  cycle,  has  been  underestimated. 


v'-*-  \ 
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Chapter  Th  ree 
Proof  of  Total  Correctness 


This  chapter  presents  the  proof  of  total  correctness  for  the  CR-unify  procedure.  The  correctness 
proof  is  divided  into  a  consistency  theorem,  a  completeness  theorem,  and  a  termination  theorem. 
The  consistency  and  completeness  theorems,  given  in  Sections  3.2  and  3.3,  respectively,  are  proved 
by  induction  on  the  depth  of  recursion  and  are  therefore  dependent  on  the  termination  theorem  in 
Section  3.4.  The  proof  of  termination  is  by  induction  on  a  noetherian  ordering  on  pairs  of  input  terms; 
it  is  a  generalization  of  Fages'  proof  of  termination  for  AC-unification  [Fages  84].  Before  presenting 
the  three  correctness  results,  we  will  begin  in  Section  3.1  with  some  definitions  for  the  proofs  and 
some  important  theorems  having  to  do  with  our  restriction  to  confined  regular  theories. 

The  correctness  proofs  depend  on  CR-unify  being  protective.  For  »he  sake  of  thoroughness,  we 
present  a  more  careful  description  of  the  naming  of  new  variables  in  Appendix  A,  along  with  a 
discussion  of  protection.  Throughout  this  chapter  we  will  assume  each  new  variable  is  one  that  has 
not  occurred  previously  and  thus,  for  example,  the  new  variables  used  in  forming  f  and  s  in  CR-unify 
are  disjoint  from  new  variables  generated  within  an  invocation  of  a  sub  theory  unification  algorithm. 
The  formal  discussion  of  protection  and  naming  of  new  variables  is  relegated  to  an  appendix  because 
it  is  more  technical  than  interesting. 

3.1  An  Overview 

Throughout  this  chapter  we  will  assume  a  given  set  of  equations,  E,  defining  a  strictly  consistent 

equational  theory.  E*  is  partitioned  by  w  =  {Ev  E2 . En),  which  also  defines  a  partition  on  the 

signature,  F,  of  E*.  For  each  sub  theory  there  exists  a  complete,  consistent,  protective,  terminating 
unification  algorithm.  We  make  no  other  global  assumptions  on  E*  and  will  be  careful  to  point  out 
cases  in  which  E*  is  assumed  to  be  confined  or  regular. 

Each  of  the  three  main  theorems,  consistency,  completeness,  and  termination,  is  proved  by  induc¬ 
tion.  We  could  imagine  combining  the  three  proofs  into  a  single  inductive  proof  of  total  correctness. 
This  is  not  necessary  because  the  termination  proof  does  not  depend  on  the  recursive  calls  returning 
complete  sets  of  unifiers,  but  depends  instead  only  on  the  termination  of  the  recursive  call  and  some 


tec‘‘i  cii'  p'cpert.es  cf  the  returned  substitutions  However,  the  proposed  single  correctness  proof 
will  provide  a  use'ul  structure  for  demonstrating  the  overall  structure  of  the  three  proofs. 

First  at  each  step  some  progress  must  be  made  towards  finding  an  answer  so  that  the  next 
unification  problem  to  be  solved  is,  by  some  measure,  easier  than  the  current  one;  this  property, 
along  with  the  fact  that  our  measure  of  complexity  does  not  decrease  infinitely,  implies  termination. 
Second,  each  step  must  generates  some  piece  of  what  could  turn  out  to  be  a  legitimate  unifier,  i.e., 
we  cannot  generate  any  incorrect  pieces  This  is  the  criterion  for  consistency.  Finally,  every  pos¬ 
sibility  leading  to  a  good  answer  must  be  considered  The  set  of  partial  answers  generated  at  each 
step  must  be  complete  in  the  sense  that  every  most  general  unifier  can  be  formed  from  one  of  the 
elements  of  the  set.  This  property  will  give  us  completeness.  By  separating  these  three  independent 
properties  into  three  theorems  we  are  able  to  focus  our  attention  on  one  of  the  correctness 
properties  at  a  time. 

In  proving  the  inductive  hypotheses  in  the  consistency  and  completeness  proof,  we  need  to  show 
that  the  basic  approach  to  breaking  down  the  problem  and  building  up  the  solutions  is  correct. 
Intuitively,  we  would  like  to  show  the  diagram  of  Figure  3  1  is  correct.  Although  we  will  not  prove  this 
diagram  commutes  it  will  help  motivate  the  technical  lemmas  in  the  proof  of  completeness,  and  is 
very  close  to  the  correctness  diagram  that  will  be  proved.  There  are  two  levels  of  detail  at  which  the 
diagram  should  be  viewed.  Consider  first  only  the  mappings,  i.e.,  the  labels  on  edges,  and  note  that 
the  right  and  left  halves  are  mirror  images,  each  showing  o°p=o°y.  Now  consider  the  labels  on 
vertices,  t  and  s  are  homogeneous  terms  and  t  and  s  are  some  possibly  inhomogeneous  instances  of 
these  terms,  p  is  an  E(-unifier  of  f  and  s,  and  a  is  an  E-unifier  of  f  and  s. 


then  for  any  sub  theory  unifier,  p,  of  the  homogeneous  terms,  alt  E-unifiers  of  p  and  y  are  E-unifiers  of 
i  and  s.  The  completeness  argument  says  roughly  that  for  any  E-unifier,  a,  there  exists  some  sub¬ 
theory  unifier,  p,  of  the  homogeneous  terms,  such  that  a  is  an  E-unifier  of  p  and  y. 

3.1.1  Definitions  for  Terms  in  E* 

Because  E*  is  partitioned,  the  terms  in  ‘J/  =  have  some  interesting  properties  related  to  the  par¬ 
titioning.  This  section  establishes  some  new  concepts  for  describing  these  properties. 

There  is  a  partial  order  on  terms  defined  as  t<s  if  and  only  if  t  is  a  subterm  of  s.  We  use  this  to 
define  another  partial  order  on  terms  that  is  contained  in  but  takes  into  account  the  equivalence 
classes  of  function  symbols  defined  by  it. 

If  o  is  a  proper  occurrence,  let  prefix[ o)  be  the  string  o  minus  the  last  number,  i.e., 
for  all  o*£,  there  exists  i  such  that  o  =  prefix( o).i.  In  the  tree  representation  of  terms,  prehx(o) 
indexes  the  parent  of  the  node  at  occurrence  o. 

Definition.  Given  a  partition,  =,  on  F,  a  term,  t,  and  an  occurrence,  o  €  0(f),  o  is  said  to  be 

W 

significant  in  t  if  and  only  if  either: 
l.o  =  £,  in  which  case  t/o  *  t, 

2.  or,  o  is  not  strict  in  f,  i.e.,  t/o  is  a  variable, 

3.  or,  [t/o).head*  (t/prefix(o)).head. 

V 

In  other  words,  an  occurrence  in  a  term  l,  is  significant  if  the  subterm  at  the  occurrence  has  a  head 
symbol  in  a  different  partition  of  F  than  the  symbol  it  occurs  under.  In  addition,  the  empty  occurrence 
and  all  variable  occurrences  in  a  term  are  significant. 

Definition.  The  term  s  is  a  significant  subterm  of  t,  denoted  s^y  t ,  if  and  only  if: 

3  o£0{f)  such  that  t/o  =  s  and  o  is  a  significant  occurrence  in  t. 

Note  that  s  may  appear  at  both  significant  and  insignificant  occurrences  within  f,  but  if  at  least  one 
occurrence  is  significant,  then  s  is  significant  in  f.  If  s  is  proper  in  t  as  well  as  significant,  we  write 

s-<yf. 

Let  F  be  partitioned  in  to  {  + ),  {•,  a),  {•}  and  {b,f,  g}  as  in  Figure  2-1.  Figure  3-2  shows  a  term 
with  all  of  its  significant  subterms  outlined.  As  noted,  a  is  significant  in  its  first  and  third  occurrences, 
but  not  in  its  second  occurrence. 

A  suggestion  of  the  relevance  of  significant  subterms  comes  from  considering  an  arbitrary  term,  t, 


Figure  3-2:  A  Term  and  its  Significant  Subterms 


its  homogeneous  form,  f,  and  the  preserving  substitution,  y  =  Preserve^,  f).  Ail  terms  in  are 
significant  in  t.  The  only  significant  subterms  of  the  homogeneous  term,  f,  are  f  itself  and  the 
variables  in  the  f. 

Next  we  define  the  notion  of  the  parents  of  a  term  t  in  s,  denoted  Parents(t ,  s),  where  a  parent  is  an 
operator  symbol  in  s  having  t  as  an  argument.  A  special  null  operator,  n ,  is  included  denote  the 
parent  of  a  term  within  itself. 

Definition.  The  parents  of  t  in  s,  written  Parents (/,  s)  are: 

1.  if  t  =  s,  then  Parents[t,s)  =  {/»}, 

2.  otherwise,  Parents{t,  s )  =  {f\  3 /(sr...,sn)^s  &  (3i,  s,  =  f)}. 

This  definition  is  extended  to  equivalence  classes  of  F  by  considering  only  those  classes  with  a 
representative  in  Parents(t,  s): 

Definition.  The  parent  sets  of  t  in  s,  written  ParSets(t,  s)  are: 

1.  Iff  =  s, then  ParSefs(f.s)  =  {{«}}, 

2.  otherwise  ParSets(t,  s)  =  {[/l  lf€Parents(t,s)}, 

Taking  t  to  be  the  term  in  Figure  3-2  and  using  the  same  partitioning  on  F  ({  +  },  {•,  a},  {•}  and  {b,f, 
S  }),  we  get  the  following  values  of  parents  and  parent  sets  in  t. 

Parents(x ,t)  =  {*)  ParSets{x,t)  =  {{♦,  a}) 


Parents  \  f)  =  {j;} 
Paremsi: .  t)  =  {} 

pj-t> -7, /)  =  [t;  } 


ParSeH>(u  I)  =  {{•</},{/>.  /,  i’ }} 
ParSefs(>  r)  =  {{/>,/, i*}} 
ParSets(:,  t)  =  {{}} 

ParSefs(f,  /)  =  {{«)} 


3.1.2  Properties  of  Confined  and  Regular  Theories 

As  stated  in  Section  2.1.3.  we  will  limit  the  set  of  theories  in  E*  by  two  syntactic  restrictions: 
confinement  and  regularity.  In  this  section  we  give  some  of  the  lemmas  pertaining  to  confined  and 
regular  theories,  characterizing  the  ways  in  which  unification  is  simplified  in  these  theories.  For  both 
confinement  and  regularity,  we  first  prove  that  the  restriction  on  the  axioms  is  equivalent  to  the  same 
restriction  on  the  theory. 

3. 1.2.1  Confined  Theories 

Recall  the  definition  of  confined  theories  limits  the  equations!  theories  to  those  containing  no 
equations  with  a  variable  equal  to  a  non  variable.  Lemma  1  shows  that  we  eliminate  exactly  those 
theories  that  are  unconfined  by  eliminating  those  with  unconfined  presentations.  Lemma  2  will  show 
that  confined  theories  contain  only  equations  with  head  symbols  constrained  by  the  same  Ejt  and 
Theorem  I  relates  this  property  to  the  unification  problem. 

Lemma  1  and  a  number  of  other  proofs  involving  equational  theories  will  be  done  by  induction  on 
the  length  of  proof.  By  the  completeness  of  the  inference  rules  in  Section  1 .4.2,  we  know  that  if  f  |S 
then  there  exists  a  finite  proof  of  f  =s,  starting  from  the  axioms  in  E  and  using  only  the  five  listed 
inference  rules.  We  use  this  fact  in  the  proof  of  Lemma  1.  for  example,  where  we  want  to  show  there 
are  no  equations  in  E*  that  are  unconfined  if  there  are  no  equations  in  E  that  are  unconfined.  To 
show  any  equation  /  =s  is  confined,  we  need  to  show  that  the  inference  rules  only  prove  confined 
equations,  when  they  start  with  confined  axioms.  If  we  measure  the  proof  as  the  number  of  steps, 
where  each  step  requires  application  of  one  rule  of  inference,  then  the  basis  case  is  a  proof  of  length 
zero.  If  the  equation  is  proved  without  any  inference  rules,  then  it  must  be  in  the  set  of  axioms. 


The  inductive  step  proves  that  a  proof  of  length  n  +  1  gives  only  confined  equations,  assuming 
proofs  of  length  n  give  only  confined  equations.  Proving  the  induction  step  is  done  by  examining 
each  possible  inference  rule  that  could  take  us  from  the  n,h  to  the  n  +  1st  step.  If  the  transitive  rule 
was  the  last  step  in  a  proof  of  t  =s,  then  there  must  have  been  some  other  term,  r,  such  that  t  =  r  and 
r  =s  were  proved  in  n  steps  or  less.  We  can  therefore  assume  the  inductive  hypothesis  on  r  =r  and 
r  =s,  which  says  both  equations  are  confined.  Combining  this  with  the  fact  that  the  term  r  occurs  in 
both  equations,  we  know  that  either  t,  s,  and  r  are  all  variables,  or  t,  s,  and  r  are  all  non-variables;  in 
either  case  the  equation  t  =s  will  be  confined,  and  the  induction  step  for  transitivity  is  proved. 


Lemma  1 :  E  is  confined  if  and  only  if  E*  is  confined. 

Proof.  One  direction  (<=)  is  obvious,  the  other  will  be  proved  by  induction  on  the  length  of  proof  in 
E*.  Let  E  be  confined  and  show  that  any  equation,  t  =s,  in  E*  must  be  confined. 

1 .  Basis:  f  =s  is  an  axiom  in  E,  so  t  =s  is  confined  because  E  is  confined. 

E  E 

2.  Inductive  step:  If  t  =s  has  a  proof  of  length  n,  then  consider  each  possible  inference  rule  for 
the  last  step  in  the  proof: 

A.  Reflexive:  Trivial. 

B  Symmetric  and  transitive:  Follow  directly  from  the  induction  hypothesis. 

C.  Equality:  Yields  only  pairs  of  non  variable  terms. 

D.  Instantiation:  From  t'  =s'  deduce  at'  =as\  where  t  is  at'  and  s  is  as'  and  t'  =s'  has  a  proof 

EE  E 

of  length  n-1  or  less.  By  the  induction  hypothesis,  either  t'  and  s'  are  both  non- variables 
or  both  variables.  If  they  are  both  non-variables,  t  and  s  will  be  also.  If  they  are  both 
variables,  then  they  are  the  same  variable  (by  consistency  of  E*),  so  t  and  s  are  either 
both  the  same  variable  or  both  the  same  non-variable  term. 

□ 

One  condition  used  in  our  proof  of  correctness  of  CR-unify  is  that  E*  is  a  confined  theory.  Lemma 
2  shows  that  in  confined  theories  there  are  no  equations  whose  right  and  left  head  symbols  are 
constrained  by  different  sub-theories.  This  gives  us  the  invariant,  stated  in  Theorem  I,  that  two 
non-variable  terms  whose  heads  symbols  are  constrained  by  different  sub-theories  (from  v)  of  E*  are 
not  E-unifiable.  We  make  use  of  this  fact  in  case  4  of  CR-unify  where  an  empty  set  of  unifiers  is 
returned. 

Lemma  2:  If  E*  is  a  confined  theory,  then  t  =s  =>  t.head  =s.head. 

E  v 

Proof.  By  definition,  v  presents  the  same  theory  as  E,  although  the  two  may  contain  different 
axioms.  We  will  make  use  of  this  fact  by  performing  induction  on  the  length  of  proof  that  t  =s  starting 
from  the  axioms  in  w,  rather  than  the  axioms  in  E. 

1.  Basis:  If  f=s  is  an  axiom  in  some  EjEw,  then  because  v  defines  the  partition  on  F, 
t.head  =s.head. 

2.  Inductive  step:  Consider  each  possible  inference  rule  for  the  last  step  in  the  proof. 

A.  Reflexive:  Trivial. 

B.  Symmetric  and  Transitive;  Follow  directly  from  the  induction  hypothesis. 


->  ->  -j  -.w  --.  v 


C.  Equality:  Yields  only  terms  with  the  same  head  symbol. 

D.  Instantiation:  From  t'  =s'  deduce  at'  =as',  where  at'  =  t  and  as'  =  s. 

E  E 

i)  If  t'  and  s'  are  non- variable  terms,  then  by  the  induction  hypothesis  t'.head  = s'. head 
and  hence  t.head  =$.head. 

9 

ii)  If  either  t'  or  s'  is  a  variable,  then  they  are  both  variables  and  both  the  same  variable. 
Therefore  t  and  s  are  the  same  term  and  the  property  holds  trivially. 

□ 


Theorem  I:  If  E*  is  a  confined  equational  theory,  and  t  and  s  are  non-variable  terms  such  that 
t.head*-s.head,  then  t  and  s  are  not  E-unifiable. 

tr 

Proof.  Any  instance  of  t  and  s  will  have  the  same  non-equivalent  head  symbols,  and  by  Lemma 
2  these  two  terms  cannot  be  equal  in  E* .  □ 


3.1 .2.2  Regular  Theories 

The  restriction  to  regular  theories  is  needed  for  case  C  of  CR-variable-unify,  a  failure  case,  f.e., 
because  C  is  a  failure  case,  CR-unify  is  not  complete  for  all  confined  theories,  but  regularity  along 
with  confinement  of  E*  is  sufficient  for  completeness.  (See  Section  2.4  for  an  example  showing 
incompleteness  of  CR-unify  for  unconfined  theory  and  a  non-regular  theory.)  The  key  result  of  this 
section  is  Theorem  II.  It  states  that  a  variable  is  not  unifiable  with  a  term,  f,  containing  that  variable  if 
the  variable  occurs  below  the  homogeneous  part  of  the  term,  i.e.,  at  an  occurrence  not  in  f.  Lemma 
4  is  used  to  prove  Lemma  5  which  in  turn  is  used  to  prove  Lemma  6. 


Lemma  3:  E  is  regular  if  and  only  if  E*  is  regular. 

Proof.  Again,  one  direction  (<=)  is  obvious,  the  other  will  be  proved  by  induction  on  the  length  of 
proof  in  E*.  Let  E  be  regular  and  show  that  any  equation,  t  =s,  in  E*  must  be  regular. 

1 .  Basis:  t  |S  as  an  axiom  in  E,  so  t  |S  is  regular  by  the  hypothesis  on  E. 

2.  Inductive  step:  Consider  each  possible  inference  rule  for  the  last  step  in  the  proof: 

A.  Reflexive:  Trivial. 


B.  Symmetric  and  transitive:  Follow  directly  from  the  induction  hypothesis. 

C.  Equality:  From  /.  =sr  1  SiSn,  deduce f(t1 . fn)|A(si . sn)>  where  t  is  f(ty...,tn)  and  s  is 

/(s1 . $n).  Since  t  contains  the  union  of  all  variables  in  the  t’ s  and  s  contains  those  in 

the  s/s,  these  two  sets  are  equal  because  they  are  each  the  union  of  n  pairwise  equal 
sets. 
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D  Instant. ation  From  =s‘  deduce  af'=os’.  where  t  is  at  and  s  is  os'.  By  the  induction 
E  E 

hypothesis.  i(r )  =  Ifs  ).  so  consider  each  i  in  this  set.  If  r  is  in  the  domain  of  o,  then 
the  variables  m  a .  will  occur  in  both  t  and  s  and  otherwise  i  itself  will  occur  in  both  t 
and  s. 

D 

The  property  gained  by  restricting  E*  to  a  regular  theory  can  be  seen  by  considering  the  set  of  all 
significant  subterms  of  a  pair  of  equivalent  terms.  Lemmas  4  and  5  characterize  this  property 
precisely:  Lemma  4  states  that  any  variable  occurring  in  two  equivalent  terms,  will  occur  under  the 
same  set  of  parent  sets,  i.e .  equivalence  classes  of  parents;  Lemma  5  shows  that  given  two  con¬ 
gruent  terms,  the  sets  of  all  significant  subterms  of  the  terms  are  equal  modulo  E. 

Lemma  4:  If  E’  is  confined  and  regular,  and  t  =s.  then  for  all  \  ParSets(\\t)  =  ParSets(v ,  s). 
Proof.  By  induction  on  the  length  of  proof  of  t  =s  starting  from  the  axioms  in  v. 

1  Basis  If  t  =  s  is  an  axiom  in  some  E^w.  then  t  and  s  are  homogeneous  in  the  constrained 
symbols  of  E*.  and  there  are  three  cases. 

A  If  i  (£1(0  then  by  regularity  of  E.  i  £t]s),  so  ParSets(v,  t)  =  ParSets(v ,  s)  =  0. 

B.  i  €T(f)  and  t  is  a  non-variable  term,  then  s  is  a  non  variable  term  and  by  regularity  of  E*, 
v€f{s).  Therefore,  ParSefsfv ,  f)  =  ParSefsfv ,  s),  since  equations  in  w  are  by  definition 
homogeneous. 

C.  If  t  is  a  variable  and  v  =  f,  then  s  is  the  same  variable.  Therefore,  ParSets(v,  t )  = 
ParSets(v ,  s)  =  {{//}}. 

2.  Inductive  step:  Consider  each  possible  inference  rule  for  the  last  step  in  the  proof:  use  one 
of  the  following  inference  rules. 

A.  Reflexive,  symmetric,  and  transitive:  Obvious. 

B.  Equality:  From  f(  =s(,  1  <  i  <  n,  deduce  f(ty  -.,tn)  = f(sr  --,sn),  where  t  is  /(f, . fn)  and  s  is 

/(s, . sn).  If  t  and  s(  are  non-variables,  then  by  the  induction  hypothesis,  all  variables  in 

T  will  occur  under  the  same  parent  sets  in  s(  and  vice  versa.  If  t.  and  s(  are  variables, 
then  they  are  the  same  variable  so  both  will  occur  under  the  equivalence  class,  [/]. 

C.  Instantiation:  From  t'  =s  deduce  at'  =as',  where  at'  =  t  and  os'  =  s.  Consider  the  follow- 

E  E 

ing  two  cases  on  the  occurrence  of  v  in  at'. 

i)  If  the  occurrence  of  v  in  at'  is  also  in  t',  then  by  the  induction  hypothesis  there  is 
some  occurrence  in  s'  under  the  same  parent  set.  The  same  is  true  with  t'  and  s' 
reversed. 
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ii)  If  the  occurrence  of  v  is  in  at'  and  not  in  t‘,  then  r  must  be  an  element  of  3(o)  or, 
restated,  there  exists  v'€5(cr)  such  that  r  €  Tfav')  and  v  O .  By  the  induction 
hypothesis,  ParSets( t t ')  =  ParSets(v\  s'),  and  if  ov'  is  simply  the  variable,  r,  then 
all  occurrences  of  »■  will  correspond  to  an  occurrence  of  v*  in  both  t  and  s.  If  a  >’  is  a 
non  -variable  term  containing  v ,  then  by  regularity,  the  entire  subterm  (o  v ')  will  occur 
in  both  t  and  s  and  thus  v  will  occur  under  the  same  operators  in  both. 


Lemma  5  states  that  if  t  =s,  then  there  is  a  one-to-one  correspondence  modulo  E  between  the 
E 

significant  subterms  of  t  and  the  significant  subterms  of  s.  The  main  purpose  of  Lemma  5  is  to  prove 
Lemma  6,  although  the  result  is  interesting  in  its  own  right  as  a  property  of  confined  regular  theories. 

Lemma  5:  If  E*  is  confined  and  regular,  and  t=s,  then  for  all  t '^yt  there  exists  s  such  that 

.»  * 

t  =s . 

E 

Proof.  By  induction  on  the  length  of  proof  that  t  =  s,  starting  from  the  axioms  in  w. 

1  Basis:  If  t=s  is  in  some  E  Ew,  then  t  and  s  are  homogeneous  with  respect  to  the  symbols 
E  1 

constrained  by  E*.  Therefore,  the  only  significant  subterms  in  t  are  t  itself  and  the  variables 
in  f \t).  If  /'  =  t,  we  can  take  s'  to  be  s,  and  if  t'  is  a  variable  in  t,  we  can  take  s'  to  be  the  same 
variable,  which  must  an  element  of  TIs)  by  regularity  of  E*. 

2.  Inductive  step:  Consider  each  possible  inference  rule  for  the  last  step  in  the  proof. 

A.  Reflexive,  symmetric,  and  transitive:  Obvious. 

B.  Equality:  From  1  <i<n,  deduce Aty...,tn)=f(sy...,sn),  where  t  is/ff, . fn)  and  s  is 

/(s, . sn).  By  the  induction  hypothesis,  for  every  fXyf.  there  exists  s'r<j-s)  such  that 

t'=s'.  Furthermore,  by  Lemma  2,  t.head  =  s  .head,  so  t.  will  be  significant  in  t  if  and  only 

i  e  i  1  w  1  1 

if  si  is  significant  in  s. 

C.  Instantiation:  From  t1  =s,  deduce  ^cts^  where  1  is  af1  and  s  is  as 1.  Consider  t 
such  that  ofj/o  =  f'. 

i)  If  o  is  a  strict  occurrence  of  Off,),  then  by  the  induction  hypothesis  on  n,  there  exists 

o'€0(s})  such  that  t/o  =s/o'.  Applying  ct  to  these  two  equivalent  subterms  we  get 

at  /o  =  as./ o',  but  <jf/o  is  t'  so  we  can  take  s'  to  be  as./ o'. 

1  £  1  •  1 

ii)  If  o  is  a  variable  occurrence,  then  f,/o  is  variable,  call  it  v.  By  Lemma  4, 
ParSets(v,  /,)  =  ParSets(  v ,  s,).  Therefore,  t'  will  itself  occur  significantly  in  s  if  and 
only  if  it  does  in  1. 


Vv'-.-l'Vv 
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iii)  If  o€0(t  ),  then  t'  is  a  proper  significant  subterm  of  or  for  some  r€fJ(a).  By 
regularity  of  E*,  v €T^f  )  =>  vCY^),  so  av  will  occur  in  s  =  as1f  and  hence  t'  will 
occur  significantly  in  s. 

□ 


The  above  lemmas  give  some  general  properties  about  confined  regular  theories  More  specific 
to  our  purposes  is  the  following  lemma,  which  gives  a  sufficient  condition  under  which  terms  are  not 
equal  in  the  theory.  Lemma  6  states  that  no  term  is  equal  modulo  E  to  any  subterm  of  a  proper 
significant  subterm  of  itself  and  is  proved  by  induction  on  the  structure  of  terms,  i.e.,  using  the 
subterm  ordering.  Theorem  II  relates  the  equality  problem  back  to  the  the  unification  problem. 

Lemma  6:  If  E*  is  confined  and  regular,  and  t,  s,  and  r  are  terms  such  that  f^<r^<f  s,  then  t*s. 

Proof.  By  structural  induction  on  s. 

1 .  Basis:  If  s  is  a  variable  or  constant,  then  / r-<^s,  so  the  hypothesis  is  vacuously  true. 

2.  Inductive  step:  If  s  is  a  non-variable,  non-constant  term,  then  there  are  three  cases  to 
consider  with  respect  to  r. 

A.  If  t  is  a  variable,  then  r*s  by  the  confinement  property  of  E*. 

B.  If  t  is  a  non-variable  term  such  that  t.head*s.head  then  t*s  by  Lemma  2. 

«r  E 

C.  If  t  is  a  non-variable  term  such  that  t.head  =s.head,  then  assume  t  =s  and  derive  a 

«  E 

contradiction. 

3  s'  such  that  t-<s'-<*s  and  s'.head*s.head.  (The  existence  of  the  significant  subterm, 

J  ir 

r ,  between  t  and  s  implies  the  existence  of  s'  with  inequivalent  head.) 

Since  t.head*  s'  .head,  t-<s'-<^s. 

By  Lemma  5,  t  =s  and  s'^js  implies  there  is  some  t'^yt  such  that  t'  =s'.  - 
From  Lemma  2,  t'. head  =  s'. head. 

Therefore,  t'.head*t.head ,  and,  again,  t'  must  be  is  proper  in  t. 

So  far  we  have  f'-<yf-<s'-<ys. 

Apply  the  induction  hypothesis  to  s',  using  t'  for  f,  s'  for  s'  and  t'  for  r,  respectively. 

By  the  induction  hypothesis,  r'^s',  but  this  is  a  contradiction,  since  t'  was  chosen  such 

thatf'=s'. 

E 

□ 
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The  ma  n  resj't  (or  confined  regular  theories  is  in  Theorem  II.  which  shows  that  case  C  of 
CR  wa'dO'f  „--i should  be  a  failure  case. 

Theorem  II:  If  E  is  confined  and  regular  and  r  and  s  are  non-variable  terms  such  that  r-<^s  and 
i  CiH  then  i  and  s  are  not  unifiable. 

Proof.  If  a  is  unifier  of  i  and  s,  then: 

<n  =as 
E 

and.  furthermore,  the  following  property  holds: 

<jv  ^<or-<f1. 

This  contradicts  Lemma  6.  so  no  such  a  can  exist.  □ 

Note  that  the  existence  of  the  significant  subterm,  r,  in  both  Lemma  6  and  Theorem  II  is  necessary. 
Without  r  the  Theorem  would  state  that  no  variable  is  unifiable  (in  any  confined  regular  theory)  with  a 
term  that  contains  it  A  simple  counter  example  to  this  stronger  statement  is  the  pair  of  terms /( x) 
and  i  m  the  the.ir,  prevented  by  E  =  {/(,;)=<;}:  the  terms  are  unifiable  though  E*  is  confined  and 
regular 

The  impo^ant  results  of  this  section  are  Theorems  I  and  II.  Theorem  I  will  be  used  to  show  the 
completeness  of  case  J  of  C*<  unit,-  where  the  empty  set  of  unifiers  is  returned  for  two  non-variable 
terms  with  heads  constrained  by  different  sub-theories.  Theorem  II  will  be  used  to  show  the  com¬ 
pleteness  of  case  C  of  CR  \.anabie-unify,  where  the  empty  set  of  unifiers  is  returned  for  one  variable 
and  one  non  variable  term  when  the  variable  occurs  in  the  non  variable  term  below  its  top 
homogeneous  part. 


3.2  Consistency 

This  section  presents  the  proof  of  consistency  for  CR-unify.  The  key  lemma  for  consistency, 
Lemma  7,  can  be  explained  informally  as  showing  that  any  substitution  which  unifies  a  sub-theory 
unifier,  p.  with  the  preserving  substitution,  y,  will  also  be  a  unifier  of  the  two  terms,  1  and  s  (where  the 
notation  here  is  that  of  the  CR-unily  procedure).  This  lemma  alone  is  not  enough  to  show  consis¬ 
tency,  since  the  consistency  of  CR-unify  depends  on  the  consistency  of  map  unify ,  which  in  turn 
depends  on  the  consistency  of  CR-unify.  Therefore,  we  will  use  an  induction  on  the  depth  of  recur¬ 
sion  for  proving  the  consistency  of  CR-unify  in  Theorem  III. 

Lemma  7:  Let  F.  be  the  set  of  constrained  symbols  for  and  let  t  and  s  be  homogeneous  terms  in 

F.  If  y,  p,  and  a  are  substitutions  then: 

pf  =  ps  &  cr°p  =a°y  =>  a(yf)  =a(ys). 

E.  E  E 
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Proof. 


pt  =  ps  =>  pf  |  ps,  since  E.  is  a  sub-theory  of  E 

=>  a(pt)  =  a(ps),  by  applying  the  substitution,  a 
=>  (cr°p)t  |  ( a°p)s ,  from  the  definition  of  composition 
=>  (<t°y)/  =  (<r°y)s,  substituting  o°y  for  o°p 
=>  a(yt)  |  a{ys) 

□ 

The  consistency  theorem  is  given  below.  The  proof  is  done  on  a  case-by-case  basis,  where  the 
cases  are  those  appearing  with  labels  in  CR-unily  algorithm.  The  basis  cases  are  those  in  which  no 
recursion  is  done  and  the  inductive  steps  are  those  that  involve  recursion.  Theorem  III  uses  Lemma 
7  in  proving  the  inductive  steps. 

Theorem  III:  If  a  is  a  substitution  produced  by  CR-unify(t,  s ),  then  a  is  an  E-unifier  of  t  and  s. 

Proof.  By  induction  on  the  level  of  recursion  of  CR-unify. 

1.  Basis:  Cases  1  and  4  of  the  CR-unify  procedure  and  cases  A  and  C  of  CR-variable-unify,  as 
called  in  cases  2  and  3  of  the  CR-unily  procedure : 

A.  Case  1 .  t  and  s  are  both  variables,  so  {/  «■  s}  is  a  unifier. 

B.  Case  4:  Returns  the  empty  set,  so  the  theorem  holds  vacuously. 

C.  Case  A:  v  does  not  occur  in  s,  so  the  single  returned  substitution,  {  v  *■  s),  is  a  unifier. 

D.  Case  C:  Returns  the  empty  set,  so  the  theorem  holds  vacuously. 

2.  Inductive  step:  Case  5  of  CR-unify  and  case  B  of  CR-variable-unify  are  the  inductive  steps. 

A.  Case  5:  f  and  s  are  non  variable  terms,  f  and  s  are  their  respective  homogeneous  forms, 
and  y  is  a  preserving  substitution  for  both  t,  t  and  s,  S. 

By  consistency  of  each  of  the  E .-unify  procedures,  each  pCP  is  a  unifier  of  f  and  s. 

Choose  a  value  for  p. 

Each  <t€Z  is  generated  by  map-unify(y,  p)  for  some  p,  and  is  therefore  of  the  form 
wn°...°w1,  where  ot>.€CR-unify(u.  ^v.,  «. ^yv.).  By  the  induction  hypothesis,  each 
value  of  wj  is  a  unifier  of  <*>. ^pv.  and  ^yvj,  so  a  is  an  E-unifier  of  each  correspond- 
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ing  pair  of  terms  in  the  range  of  p  and  y  and  is  therefore  a  unifier  of  p  and  y 
themselves. 

y  p  and  a.  meet  the  conditions  of  Lemma  7  with  f  and  s  for  t  and  s  in  the  lemma  and  t 
and  s  for  y t  and  ys  in  the  lemma,  so  0  is  a  unifier  of  t  and  s. 

Therefore.  off|f)UV(s)  is  also  a  unifier  of  t  and  s. 

B.  Case  9  >  is  a  variable,  s  is  non  variable  term,  s  is  the  homogeneous  form  of  s  and  y  is  a 

preserving  substitution  for  s.  s. 

By  the  assumption  of  consistency  of  the  E .-unify  procedures,  each  p€P  is  a  unifier  of  v 
and  s. 

By  the  same  argument  as  in  case  5.  each  o€2  is  an  E-unifier  of  v  and  s  in 
CR  variable  unify,  which  is  exactly  the  pair  of  terms  t,  s  or  s,  t  of  CR-unify. 

Therefore.  °l{\  }Ufls)  is  a  unifier  of  t  and  s  of  CR -unify. 

□ 

3.3  Completeness 

In  this  section  we  will  prove  that  CR  umly  is  complete  for  all  confined  regular  theories.  The  proof 
of  the  completeness  theorem.  Theorem  IV,  is  done  by  induction  on  the  depth  of  recursion,  and  its 
structure  is  similar  to  the  proof  of  consistency,  Theorem  III.  The  proof  of  Theorem  IV  is  given  in 
Section  3  3  3,  and  uses  a  number  of  lemmas  developed  in  Sections  3  3.1  and  3  3.2.  The  main  lemmas 
are  Lemmas  12  and  13  in  Section  3.3.2.  the  proofs  of  which  rely  on  the  technical  definitions  and 
lemmas  of  Section  3.3.1.  The  proof  of  Theorem  IV  will  also  use  Theorems  I  and  II,  since  each  gives  an 
independent  set  of  sufficient  conditions  for  completeness  of  a  failure  case.  Section  3.3.1  gives  some 
technical  definitions  of  functions  that  are  used  in  the  proof  of  completeness  but  are  not  needed  in  the 
implementation 

3.3.1  A  New  Homogenizing  Operation 

Our  proof  of  completeness  uses  a  function,  U-Homog,  for  forming  homogeneous  terms.  The 
homogenizing  operation  in  Section  2.1. 2.1  is  not  unique  for  a  given  input,  but  may  vary  in  the  names 
of  new  variables.  The  function  defined  here  will  use  a  special  set  of  variables  and  have  an  inverse 
mapping  which  is  a  substitution;  both  the  variable  set  and  the  inverse  substitution  are  universally 


defined  for  E*.  In  addition,  we  will  extend  U-Homog  to  a  function  on  substitutions.  The  functions 
defined  here  are  used  solely  as  aids  in  the  proof  of  completeness  and  are  used  in  the  implementation 
of  CR  unify. 

The  special  set  of  variables  used  in  forming  homogeneous  terms  will  be  denoted  by  U.  Each 
variable  in  U  represents  an  equivalence  class  of  terms  in  the  theory,  E*.  There  is  one  for  each 
element  of  quotient  algebra,  T{F,  V-U)/=.  We  will  represent  each  variable  in  U  as  where  f  is 
some  term  containing  no  variables  in  U,  and  [/]  represents  the  equivalence  class  of  which  /  is  a 
member.  By  definition,  we  know  t  =s  =>  =  u  jsj,  i.e.,  u  ^  and  u ^  are  two  denotations  for  the  same 

variable.  Henceforth,  we  will  assume  the  existence  of  this  set,  U,  as  it  is  universally  defined  for  E*, 
and  denote  V-U,  the  complement  of  U,  by  ->U. 

3.3.1 .1  Homogeneity  Using  U 

The  function,  U-Homog,  is  similar  to  Homog  except  that  each  maximal  subterm  whose  head  is  not 
in  the  set  F  will  be  replaced  with  an  element  of  U  rather  than  with  an  arbitrary  new  variable.  The 
following  definition  of  U-Homog  differs  from  the  definition  of  Homog  only  in  case  3. 

Definition.  Let  F  be  a  set  of  function  symbols  and  f  be  a  term  containing  no  variables  in  U. 
U-Homog[t ,  F)  is  defined  as  follows: 

1 .  If  f  is  a  variable,  then  U-Homog[1,  F)  =  f. 

2.  If  f  =  /  (fv...,fn)  and  /  €  F,  then 

U-Homog{t,  F)  =  f(U-Homog(ty  F),...,U-Homog[tn,  F)). 

3.  Iff  =  /(f1,..,fn)i/  C  F  then  U-Homog(t,  F)  = 

We  extend  the  notion  of  homogeneity  and  the  homogenizing  function  to  substitutions.  A  substitu¬ 
tion,  a,  is  homogeneous  with  respect  to  a  set  of  function  symbols,  F,  if  and  only  if  U  T(f)CF.  We 

td%(o) 

define  a  function  U-HomogMap  on  substitutions,  which  is  analogous  to  U-Homog  on  terms. 

Definition.  Let  a  be  a  substitution  containing  no  variables  from  U  (i.e.,  [9(<x)(JJ(a)]nu  =  0)  and  let 
F  be  a  set  of  function  symbols.  Then  U-HomogMap(o,  F)  is  a  substitution  such  that 
^(U-HomogMaph r,  F))  C  5(<j)UU  and  for  all  v  €  9(a)UU: 

1.  If  v€9(or),  then 

U-HomogMap(o,  F)v  =  U-Homog(ov,F). 

2.  If  v€U,  then  by  the  definition  of  U,  v  =  for  some  t  (where  tftjnu  =  0),  and 
U-HomogMap(o,F)v  =  U[ofj. 
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U-HomogMap  is  well-defined  because  t  =s  =>  at  =as,  so  picking  an  arbitrary  t  from  [r]  will  result  in  a 
unique  equivalence  class,  [at],  and  thus  a  unique  variable,  u  The  domain  of  substitutions  formed 
by  U-HomogMap  may  be  infinite  unlike  other  substitutions  we  have  used  thus  far.  Extending  substitu¬ 
tions  in  this  manner  gives  no  additional  most  general  unifiers,  because  any  term  has  only  a  finite 
number  of  variables.  When  the  value  of  F  is  clear  from  context,  we  will  use  7  and  o  to  denote 
U-Homog(t ,  F)  and  U-HomogMap{o,  F),  respectively. 

As  noted,  the  definition  of  T  and  7  are  identical  except  in  the  names  of  variables  used  to  replace 
subterms.  Furthermore,  if  F  is  [ t.head ]  as  in  f,  7  is  an  instance  of  f  for  any  f,  since  f  uses  different 
variables  for  each  replaced  subterm  whereas  7  will  use  the  same  variable  more  than  once  if  twc 
replaced  subterms  are  equal  modulo  E.  Therefore,  we  can  relate  f  to  7by  finding  the  match  of  7 by  t. 
Furthermore,  observe  that  this  is  the  homogeneous  form  (with  respect  to  F)  of  the  preserving  substitu¬ 
tion  for  t  and  f. 

Proposition  1:  If  F  =  [t.head]  and  y  =  Preserve^,  f),  then 

yf=*  7. 

The  definition  of  U-Homog  may,  at  this  point,  seem  somewhat  under-motivated.  The  following 
lemmas  describe  some  useful  properties  of  U-Homog  that  will  be  used  in  the  completeness  lemmas 
and  are  not  valid  for  Homog.  The  two  functions,  Homog  of  the  implementation  and  U-Homog  of  the 
lemmas,  are  related  in  the  completeness  theorem  using  Proposition  1.  Lemma  8  shows  that  U-Homog 
commutes  with  the  application  of  substitutions  to  terms,  and  Lemma  9  extends  this  property  to  com¬ 
position  of  two  substitutions.  Lemma  10  is  the  key  result  of  these  lemmas;  it  justifies  the  division  of 
the  unification  problem  in  E*  into  unification  problems  in  the  sub-theories  by  showing  that  the  exist¬ 
ence  of  a  particular  equation  in  E*  implies  the  existence  of  the  homogeneous  form  of  the  same 
equation  in  the  sub-theories.  Henceforth,  we  will  assume  that  no  substitution  or  term  contains  a 
variable  from  U,  unless  formed  from  U-Homog  or  U-HomogMap. 

Lemma  8:  For  any  set  of  function  symbols,  F; 

a  1=  at. 

Proof.  By  induction  on  the  structure  of  t. 

1 .  Basis:  t  is  a  variable,  call  it  v . 

A.  If  v  €9(a)  then 

a  7  =  o  v  =  5v 


=  av ,  by  the  definition  of  U-HomogMap. 


B  If  ••  fJ(o)  then: 


a  f  =  o  v  =  or 

=  i .  since  i  £cJ(o)  and  i  CU  =>  i  C3(o) 

=  Of  =  0>’ 

2.  Inductive  step,  f  =  /(f, . fn).  for  some /CF  of  arity  n  i.  0. 

A.  If/ CF,  then: 

oT  =  o/(f, . fn) 

=  o/(7 . 7n).  because/is  in  F 

=  /(of . of),  by  homomorphism  of  substitutions 

=  Mo7, . o7n).  by  the  induction  hypothesis 

=  ./(of, . o/n).  from  the  definition  of  U-Homog 

=  o/(f, . fn),  by  homomorphism  of  substitutions 

=  of 

B.  If/  CF.  then: 
o7  =  o/(f,,...fn) 

=  ow  because  /  is  not  in  F 
=  «M.  since  u(f]€U 

=  o7,  since  the  head  symbol  of  of  is  still  /,  and  therefore  not  in  F 

□ 

Lemma  9  extends  the  commutativity  of  U-HomogMap  with  substitution  application  to  the  com¬ 
mutativity  of  U-HomogMap  with  substitution  composition.  The  proof  is  straightforward. 

Lemma  9:  If  F  is  any  set  of  function  symbols,  then: 

-  5i°5r 

Proof.  Show  V  v€  V  o,°o2  v  =  0,oo2v.  There  are  two  cases: 

1 .  If  v  €  U ,  then  v  =  u  ^  for  some  term  f ,  then : 

W(,l  *  V/Of]  =  UloAoJ))  =  ®1M[o„f]  =  51<52w[f]> 


2.  If  v€U: 


=  o, 


“a2MW 


o^CTjV  =  =  o,°a2v  = 

=  a,(oJv)  =  O^OjV)  =  CT^ffjV 


=  w 


Lemma  10  is  the  key  to  our  completeness  argument.  Unlike  Lemmas  8  and  9,  which  do  not 
assume  any  relationship  between  F  and  it,  Lemma  10  will  take  F  to  be  a  set  of  function  symbols 
constrained  by  one  of  the  sub  theories.  It  states  that  the  homogeneous  forms  (by  U-Homog)  of  any 
two  E-equal  terms  are  Ej-equal  in  the  sub- theory  constraining  F.  It  is  important  that  F  is  the  set  of 
constrained  symbols  for  the  sub-theory,  E*,  but  we  do  not  make  any  assumptions  about  head  symbols 
of  t  and  s  belonging  to  F. 


Lemma  10:  If  F  is  the  set  of  constrained  symbols  for  some  sub-theory,  E.*  of  E*,  and  E*  is  confined, 
then: 

t=s  =>7=s. 

E  Ei 

Proof.  By  induction  on  the  length  of  proof  of  t  =s,  starting  from  the  axioms  in  tr. 

1 .  Basis:  If  t  =s  is  an  axiom  in  some  E.€w,  then: 

E  i 

A.  If  E  =  Ej,  i.e.,  f  =s  €  Ejf  then  t  and  s  must  be  homogeneous  in  F.  Therefore,  7=f,  s  =  s, 
and  7  =  s  €  E(,  so  there  is  a  proof  in  E,. 


B.  If  E*E.,  i.e.,  f  |$  $  E(  then  assume,  since  E*  is  confined,  that  both  t  and  s  are  non¬ 
variables.  Furthermore,  by  disjointness  of  function  symbols  in  elements  of  it  we  know 
P(f)UT(s)]nF  =  0.  Thus,  7=  and  s  =  w ^  and  since  t  |S,  it  follows  from  the  defini¬ 
tion  of  U-Homog  that  u  ^  and  u  ^  are  identical  and  thus  u  M  =  u  ltl  by  reflexivity. 


M  Ej  1*1 

2.  Inductive  step:  Consider  each  possible  inference  rule  for  the  last  step  in  the  proof. 
A.  Reflexive,  symmetric  and  transitive  rules:  Obvious. 


B.  Equality:  From  f.  =sjt  1  i  i  i  n,  deduce  f(ty...,tn)  =/'(s1 . sn),  where  t  is  f(ty...,tn)  and  s  is 

/(s1 . sn).  By  the  induction  hypothesis,  f“  |  s.  for  1  S  i  <  n.  There  are  two  cases  on / : 

i)  If  /  €F,  then  T-f  (71,...,7n)  and  s  =/ (s1,...,sn),  so  7=  s  is  implied  by  the  equality  rule. 

ii)  If  /  ( F,  then  7=  u  ^  and  s  =  u  jsj.  Since  t  *s,  u  ^  and  u  |sj  are  identical,  and  thus  7=  s 
by  reflexivity. 


X  . 


C.  Instantiation:  From  t'  =s'  deduce  at'  =as',  where  t  is  at1  and  s  is  as.  By  the  inductive 
E  E 

hypothesis  we  know  7  =  s’  and  applying  the  substitution  a  to  this  equation  we  get 
oT  =  os’,  which  by  by  Lemma  8  implies  of  =  os'.  But,  at'  is  /,  and  as'  is  s,  so  this  gives 

us:  E| 

E, 

□ 

Note  that  the  assumption  that  E*  is  confined  is  necessary.  Without  this  assumption,  case  B  in  the 
basis  case  would  not  hold.  Consider  an  equation,  for  example,  v  =t,  where  t.headtF.  in  this  case, 
v  =  v  and  7=  u  but  v  and  u  ^  are  different  variables  and  are  therefore  not  equal  in  E*. 

While  intuitively  Lemma  10  is  the  key  to  the  completeness  argument,  we  must  still  relate  the 
equality  problem  back  to  the  unification  problem  and  relate  the  U-Homog  forms  of  terms  in  the  lemma 
to  the  Homog  forms  in  the  algorithm. 

3.3.1 .2  The  Inverse  Substitution 

For  the  homogenizing  function  Homog,  we  were  able  to  define  the  notion  of  a  preserving  substitu¬ 
tion,  Preserve (f ,  f),  which  mapped  a  homogeneous  form  of  a  term  back  to  the  original  term.  For 
U-Homog,  such  a  substitution  cannot  be  defined,  since  two  subterms  may  be  different  terms,  but 
equal  modulo  E,  and  will  thus  be  replaced  by  the  same  variable.  We  define  instead  the  universal 
E-preserving  substitution,  ft,  which  maps  each  variable  in  U  to  some  element  of  the  equivalence  class 
of  terms  it  represents. 

Definition.  Let  n  be  a  substitution  such  that  9(jx)  =  U  and  Vu  ^€11,  fiu  ^  =  t\  such  that  t'  |f. 

The  choice  of  which  term  in  the  equivalence  class  to  use  is  not  important  but  only  serves  to  take 
us  from  an  element  of  U  back  to  the  set  of  terms  in  which  we  are  working,  terms  that  do  not  contains 
variables  from  U.  The  axiom  of  choice  guarantees  the  existence  of  such  a  substitution.  By  construc¬ 
tion  of  n  and  the  homomorphism  of  substitutions,  we  now  have  the  following  propeVty: 

Proposition  2:  For  any  term,  t,  and  any  set  of  function  symbols,  F: 

fi(T)|f. 

We  would  like  to  extend  this  property  directly  to  substitutions  and  get  no  =o,  but  because  the 
domain  of  p  contains  all  variables  in  U,  composing  p  with  the  homogeneous  form  of  a  substitution 
yields  a  substitution  with  more  variables  in  its  domain  than  the  original  substitution.  Therefore,  the 
extension  to  substitutions.^  the  following  lemma: 
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The  property  proved  in  Lemma  11  is  expressed  as  a  diagram  in  Figure  3-3.  This  diagram  will 
appear  as  a  basic  component  in  Figure  3-4,  which  illustrates  the  main  completeness  lemma,  Lemma 
13  Figure  3-4  will  bring  us  very  close  to  the  preliminary  correctness  diagram  of  Figure  3- 1 . 


Figure  3-3:  Commuting  Diagram  for  the  Universal  E-preserving  Substitution 


3.3.2  The  Completeness  Lemmas 


Having  built  up  some  background  lemmas  to  use  in  proofs,  we  are  now  ready  to  prove  the  main 
results  leading  to  completeness.  The  first  lemma,  Lemma  12  shows  that  the  decomposition  process  is 
complete,  that  every  E-unifier  is  made  up  only  of  pieces  of  E.-unifiers.  The  second  lemma,  Lemma  13, 
shows  that  the  combination  process  of  unifying  substitutions  is  complete,  that  all  the  necessary 
combinations  of  Ej-unifier  pieces  are  considered. 

In  the  following  lemmas,  assume  t  and  s  are  terms  with  compatible  head  symbols,  i.e.,  either  one  is 
a  variable  and  the  other  is  not,  or  they  are  both  non-variables  with  roots  constrained  by  the  same 
sub-theory.  Let  y  be  the  combined  preserving  substitution  for  t  and  s,  i.e.,  yf=f,  ys  =  s  and  9(y)  C 
T(f)UT{s).  E t  unify  will  be  the  sub-theory  unification  algorithm  for  the  sub-theory  constraining  the 
head  symbols  of  t  or  s  (one  or  both  depending  on  whether  they  are  both  non-variables  or  not). 

Lemma  12: 

If  E*  is  confined  then: 

at  =as  =>  a°yf  =  a®y3 
E  E. 

i 

Proof. 


ot  a  os  =>  crt  =  os,  by  Lemma  10 
E  Ei 

=*>  a  t  =  as,  by  Lemma  8 
Ei 

=>  ayf  =  ays,  by  Proposition  1 
=>  a°y  f  =  a°y  s,  by  Lemma  9 

□ 

This  proves  the  existence  of  an  E.-unifier  of  the  homogenized  terms  for  any  E  unifier  of  the 
unhomogenized  terms,  and  just  as  important,  gives  a  way  of  constructing  the  E^-unifier  from  the 
E  unifier.  This  is  the  key  to  showing  that  our  approach  of  dividing  E-unification  problems  into  several 
E(  unification  problems  is  complete  We  still  need  to  show  that  the  manner  in  which  E-unifiers  are 
constructed  from  the  unifiers  in  the  algorithm  is  complete. 

Recall  that  in  CR -unify  the  E -  unifiers  are  combined  through  unification  of  substitutions  with  the 
preserving  substitution  One  property  sufficient  to  show  completeness  would  be  that  any  E- unifier 
unifies  the  preserving  substitution  and  the  constructed  Ej- unifier.  However,  this  property  does  not 
hold  in  general  for  the  following  reason:  some  of  the  variables  in  the  range  of  the  sub-theory  unifier 


may  be  elements  of  U,  whereas  none  of  the  variables  in  the  range  of  y  can  be  in  U,  and  furthermore,  a 
does  not  contain  any  variables  from  U  in  its  domain,  so  a  will  not  unify  the  two  substitutions. 

Instead  of  the  property  in  Figure  3- 1 ,  we  show  a  weaker  but  still  sufficient  result  which  states  that 
a  is  part  of  a  substitution  that  unifies  p  and  y.  The  substitution  fi  will  solve  the  problem  of  the 
variables  in  3(p)  being  in  U  since  it  maps  every  variable  in  U  to  some  term  containing  no  variables  in 

U. 

Lemma  13  shows  that  the  substitution  a°p  is  an  E-unifier  of  y  and  the  constructed  p;  and  Figure 
3-4  gives  the  pictorial  representation  of  the  proof.  The  proof  of  Lemma  13  starts  from  the  result  in 
Lemma  1 1 ,  just  as  the  left-hand  diagram  of  Figure  3-4  shows  two  instances  of  Figure  1 1 .  Similarly,  the 
right-hand  diagram  represents  the  last  step  in  the  proof  of  Lemma  13,  namely  the  statement  of  the 
Lemma.  Note  that  the  right-hand  diagram  of  Figure  3-4  is  almost  identical  to  the  diagrams  in  Figure 
3-1,  our  original  goal  for  a  correctness  diagram;  the  only  difference  is  that  the  a  arrows  in  Figure 
3-1  are  a°p  arrows  in  Figure  3-4. 


■=>  a°/i°(o°Y)  |  o°o°y°i i.  by  applying  a 

=*>ocp°(o°y)  =u°Y0/x.  by  idempotence 

=»  a°/i°(o0 y)  |U  a°Y°(i-  since  U  C  V 

=>  o°p°(o°y)  |U  a°y.  Since  cl(p)  =  U 

=>  o°n°(o°y)  =U  a°/i°Y.  since  3(y)HU  =  0  and  3(fi)D~iU  =  0 

=>  o°p°p  |U  a°p°y.  since  0(a°Y)  t-I(p)CU 

=>  o°p°p  =  a°jx°Y.  since  (fi(Y)UrJ(p))nu  =  0 

□ 

3.3.3  A  Proof  of  Completeness 

This  section  proves  the  completeness  theorem  for  CR-unify.  The  proof  works  in  the  reverse 
direction  from  the  CR-unify  algorithm,  showing  that  any  "good"  answer  was  made  from  pieces  that 
must  be  considered  by  CR-umfy.  We  assume  the  input  terms  have  a  unifier,  and  by  observing  some 
properties  of  the  unifier,  show  that  either  it  or  a  more  general  unifier  will  be  produced  by  CR-unify. 
The  proof  may  also  be  somewhat  confusing  because  the  result  seems  too  weak.  At  every  step  we  are 
showing  the  there  is  some  partially  formed  unifier  that  is  more  general  than  any  actual  unifier,  but  we 
never  show  that  the  pieces  form  a  unifier.  Recall,  however,  that  this  condition  was  proved  separately 
in  the  proof  of  consistency,  Theorem  III. 

One  of  the  non  obvious  steps  in  the  proof  is  that  the  minimal  set  of  E^unifiers  is  sufficiently  large 
to  find  a  complete  set  of  E-unifiers.  Note  that  this  does  not  follow  directly  from  the  Lemma  in  Section 
3  3. 2,  since  they  only  show  that  the  E-unifier  can  be  constructed  (through  substitution  unification) 
from  some  E^unifier  and  not  that  it  can  be  constructed  from  a  minimal  E.-unifier.  This  is  important  to 
the  completeness  argument  since  the  E^unification  algorithms  are  only  assumed  to  return  minimal 
complete  sets  of  unifiers. 

Theorem  IV:  If  a  is  an  E-unifier  of  t  and  s,  then  there  exists  8  €  CR-unify(t.  s)  such  that  8  ^  a. 

E 

Proof.  By  induction  on  the  depth  of  recursion  in  CR-unify.  Proving  the  induction  step  will  require  a 
second  induction  on  the  number  of  calls  to  CR-unify  made  from  map-unify. 

1.  The  basis  for  induction  on  the  depth  of  recursion  are  those  cases  in  CR-unify  for  which 
map-unify  is  not  called,  namely,  cases  1  and  4  of  CR-unify  and  case  A  and  C  of 
CR-variable-unify  under  either  2  or  3  of  CR-unify. 


A  Case  1 :  t  and  s  are  both  variables.  Any  unifier  of  f  and  s  is  an  instance  of  [t*-s). 

B.  Case  A:  v  is  a  variable  not  occurring  in  s.  Any  unifier  is  an  instance  of  {v*-s}. 

C.  Case  C:  v  is  a  variable  occurring  in  s  and  in  5(y).  Therefore,  v  is  a  variable  in  some 
non  variable  proper  significant  subterm  of  s,  and  by  Theorem  II,  v  and  s  are  not  unifi- 
able.  The  empty  set  is  a  complete  set  of  unifiers. 

D  Case  4:  t  and  s  are  non- variable  terms  such  that  t.head*s.head.  By  the  Theorem  I,  t  and 

V 

s  are  not  unifiable.  Again,  the  empty  set  is  complete. 

2.  The  inductive  step  includes  cases  5  of  CR-unily  and  case  B  of  CR-variable-unify  when 
called  from  either  2  or  3  of  CR-unily. 

A  Case  5:  t  and  s  are  non-variables  such  that  t.head  =s .head.  Let  E.*  be  the  sub-theory  of 

ir  1 

E*  constraining  t.head  and  s.head  and  let  F  be  the  set  of  function  symbols  constrained 
by  E*. 

We  will  first  show  that  the  substitution  p  constructed  in  Lemma  1 2  bears  a  useful  relation 
to  some  sub-theory  unifier  found  in  CR-unily. 


Letp  =  a°Ylg(<FoY)- 


From  Lemma  12  we  know: 


pf  »  p§. 


By  the  completeness  of  E.-unify,  there  exists  p'  €E .-unify(t  s)  such  that 

P'  b  P,forv1=nnuns). 

Ei 

Therefore, 3<p  such  that  <p°p'  =ip  and  since  E.*  is  a  sub-theory  of  E*, 

(pop'^lp. 

Without  loss  of  generality,  assume 

g(<p)C9pUJ(p'). 

By  protectiveness  of  <p'  we  know 

(g(p)U3(p)ug(p')U5(p'))n(vrv2)  =  0. 

Let  V  =  ‘HOUTJs)  and  V,  =  \rUV_  and  from  the  previous  step  we  have 

2  3  1* 

y 

(pop'  |3p. 

Using  these  values  of  p,  p',  and  <p,  we  can  show  the  the  inductive  step  for  case  5,  i.e., 
that  there  exists  6Emap-unily(p\  y),  such  that  6  £2o.  By  examination  of  map-unity 


y- 


•  ' 


at.  a 

u 

r„.\  ■ 


•  ■  -  V 
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we  know  that  6  would  have  to  be  a  substitution  of  the  form  wn°...°w0,  where  n  is  the 
size  of  ^(p')U3(y)  and  u.  is  the  unifier  of  the  ith  pair  of  terms.  Let  a.  be  the  accumu¬ 
lated  unifier  for  the  ith  iteration,  i.e.,  a.  =  ° ...°  <oQ.  Show  by  induction  on  i  that 

a  is2  a. 
n  E 

i)  Basis:  If  i  =  0,  then  a0  =  uQ  =  i.  Therefore,  <jq  a.  (Note:  n  =  0  would  cor¬ 
respond  to  both  t  and  s  being  homogeneous  and  equal  modulo  E.) 

ii)  Inductive  step:  If  i>0,  then  let  vf  be  the  ith  variable  considered  in  5(p')U3(y)  for 
forming  corresponding  pairs  of  terms. 

Show  there  exists  CR-unify(<j. ^pV,  a.  ^v.)  such  that  a.  =  u laaj  t  <2  o. 

v 

ffj |2  0,  by  the  induction  hypothesis  on  i 

a. M  |2  0°p,  since  9(p)n3(a)  =  0  &^{p)nV2  =  0 

3^  such  that  '/'°0j.1  =20°  ft,  by  definition  of  ^2 

a°n°p  =  a°p°y,  from  Lemma  13 
v 

a°fi°p  =3  a°fi°y 

,  v  ,  V 

0°ft °<p°p  =3  <r°p°y,  since  <p°p  =3  p  was  shown  above 

3tp'such  that  <p'°a°p°p '  =3 a°p°y,  since  Vx€9(<f>),  xi  3(<p')  and: 

a)  If  x  €V2,  then  x  €9{0): 

p°<p°p'x  =  p°<px 
=  p°ax 


b)  lfjc€VrV2,thenx€9(Y): 
xiS(p)  &  jrt3(0). 

c)  If  JtCVgV^Vg,  then  x  is  a  new  variable  from  p': 

x$%o). 


Using  this  new  substitution,  <p',  we  have: 

<p'°<p'°a°p°p'  |3(p'a0ft0y, 

•  ,  V 

<p  °o°p°p  =3<jp  °0°poY, 

<P  °'l/0o.i.10p'  |3  M0y,  by  replacing  o°p  with  \ p<>oj 
Therefore,  is  an  E-unifier  of  tj  =  o,.1p'vj  and  sj  =  <j,  ^v.. 


By  the  induction  hypothesis  on  t  and  s,  there  exists  a  substitution,  wj  € 


CR-unify{t.s).  such  that: 

s4  where  V4  =  VtrjUY^). 

Since  3(qr ')  contains  variables  from  V1  only  if  they  are  in  3(o), 

v, 

<p  °a  =  1<r. 

E 

v 

Furthermore,  «  °o  |’  tp°if°oir  since  by  protectiveness  of  CR-unity, 

.v  i(a)  only  if  x  €V4. 

We  also  have  <r  °i P°o  <  =  ia>°ff=i(j=iffou. 

11  E  EE 

V 

Therefore.  <r  =  w(°o|1  |!  a.  which  which  completes  the  induction  proof  on  i. 
v 

The  induction  on  i  shows  «rn  s2a,  which  proves  the  inductive  step  on  /  and  s  in  case  5 

and  therefore  proves  the  existence  of  some  8  produced  by  CR-unily{t,  s)  such  that 
v 

8  ho 

E 

B  Case  B:  The  proof  follows  Case  5,  except  r  replaces  both  t  and  f. 


3.4  Proof  of  Termination 

If  recursive  calls  from  CR-unify  were  made  only  to  subterms  of  the  original  inputs,  then  termination 
would  be  obvious.  However,  at  each  iteration  within  map-unify,  the  substitution  accumulated  up  to  a 
given  point  is  applied  to  the  next  corresponding  pair  of  terms,  so  the  terms  of  a  recursive  call  are  not 
necessarily  subterms  of  the  original  inputs,  and  may  be  larger  than  the  inputs.  The  proof  of  termina¬ 
tion  in  Section  3.4  uses  noetherian  induction;  we  define  a  noetherian  ordering  on  terms  which  is 
proved  to  be  strictly  decreasing  with  each  level  of  recursion.  The  ordering  is  a  generalization  of  the 
ordering  used  by  Fages  to  show  termination  of  AC-unification  [Fages  84].  The  proof  of  termination  is 
a  generalization  of  the  termination  proof  for  AC-unification  [Fages  84],  We  consistently  extend  his 
definitions  to  handle  the  general  case  for  unification  in  equational  theories.  With  minor  exceptions, 
our  definitions  would  be  identical  to  Fages'  if  we  restricted  ours  to  only  the  associative-commutative 
and  empty  theories.  Noetherian  induction  is  discussed  briefly  in  Section  3.4.1.  Section 
3.4.2  describes  the  noetherian  ordering  that  is  the  basis  of  our  termination  proof  and  then  gives  some 
lemmas  on  the  ordering. 


3.4.1  Noetherian  Induction 

Classical  induction  is  based  on  a  total  ordering,  typically  the  "less  than"  ordering  on  the  natural 
numbers.  Noetherian  induction  is  more  general  in  that  it  is  based  on  a  partial  ordering;  the  partial 
ordering  is  additionally  required  to  have  no  infinite  decreasing  paths.  The  reader  is  referred  to  [Cohn 
65]  for  a  justification  of  noetherian  induction  and  to  [Huet  80b]  for  some  abstract  properties  of  order¬ 
ings. 

A  typical  class  of  noetherian  orderings  that  are  not  total  are  those  formed  as  lexicographic  exten¬ 
sions  of  two  or  more  total  orderings.  In  fact  any  lexicographic  extension  of  noetherian  orderings  is 
itself  a  noetherian  ordering.  We  will  use  this  fact  in  the  definition  of  the  noetherian  ordering  for  our 
termination  proof. 

3.4.2  A  Noetherian  Ordering  for  E-Unification 

The  input  to  CR-unify  is  a  pair  of  terms,  and  we  define  a  noetherian  ordering  on  pairs  of  terms  to 
perform  the  induction.  The  partial  ordering  on  terms  defined  by  the  subterm  property  is  not  accept¬ 
able  as  a  basis  for  our  induction  because,  as  mentioned,  recursive  calls  are  made  to  terms  formed  by 
applying  substitutions  to  subterms  of  the  inputs  and  not  just  to  subterms  of  the  original  input  terms. 
Therefore,  a  recursive  call  from  CR-unify  is  not  necessarily  made  to  arguments  that  are  strictly  less 
than  the  inputs  by  the  subterm  ordering.  We  instead  define  an  ordering  that  is  contained  in  the 
subterm  ordering,  but  does  not  increase  when  certain  substitutions  are  applied  to  inputs.  This  order¬ 
ing  is  shown  to  decrease  with  each  level  of  recursion.  It  is  a  lexicographic  extension  of  two  orderings 
on  the  size  of  sets  r  and  j>,  which  are  defined  below. 

Because  each  Ej-unification  procedure  is  assumed  to  terminate,  it  is  is  appropriate  for  the  current 
discussion  to  think  of  the  unification  of  two  homogeneous  terms  from  the  same  sub-theory  as  being  a 
single  computation.  Expanding  on  this  idea,  it  is  more  difficult  to  unify  a  term  with  many  in¬ 
homogeneous  subterms  than  one  that  is  close  to  being  homogenous,  even  if  the  more  homogeneous 
term  has  a  larger  actual  size.  This  should  help  motivate  the  definition  of  the  first  measure  of  com¬ 
plexity  based  on  the  set  of  distinct  terms  having  strict  and  significant  occurrences  in  the  input  terms. 

Definition,  t (t,s)  =  (r  |  r  is  non-variable  &  (r^f  or  r^s)}. 

The  intuition  behind  the  second  complexity  measure  is  more  difficult,  but  is  related  to  t  in  the 
following  sense:  If  a  variable  occurs  under  more  than  one  operator  set,  where  by  operator  set  we 
mean  an  equivalence  class  of  F,  then  any  substitution  of  a  non-variable  term  for  the  variables  will 
result  in  at  least  one  new  strict  significant  occurrence  in  the  resulting  term.  Therefore,  v  is  the  set  of 
variables  occurring  under  more  than  one  operator  set  in  t  and  s. 


Definition.  Let  v(t,s)  =  {.r£V  |  s/ze(ParSe/s(jr ,  /}  U  ParSets(x ,  s))  >  1} 

We  will  measure  the  complexity  of  a  particular  unification  problem  by  considering  the  cardinalities 
of  v  and  t,  and  use  this  measure  of  complexity  to  define  an  ordering  on  the  pairs  of  terms  comprising 
the  inputs.  The  ordering  will  be  denoted  —<£  and  is  defined  as  the  lexicographic  extension  of  the 
cardinalities  of  v  and  r: 

Definition.  <f',  s'>  -<£  <f,  s>  if  and  only  if: 

1 .  size(v{t\  s'))  £  size(r[t,  s)) 

2.  and,  ( size(v(t\  s'))  =  size(v{t,  s))  =>  size(r(t',  s'))  <  size(r(t,  s))). 

We  will  use  the  notation  to  denote  a  reflexive  ordering  containing  -<£,  i.e.,  <f',  s'>^c<f,  s>  if  und 
only  if: 

2.  or,  (size(v(t',  s'))  =  size(i>(t,  s))  &  size(r(t',  s'))  =  s/ze(T(f,  s))). 

Note  that  the  t  and  s  can  be  commuted  in  the  ordering,  since  r(t,  s)  =  t(s,  f)  and  v(t,  s)  =  v(s,  t). 
This  is  consistent  with  our  expectations  of  a  good  measure  for  the  complexity  of  unification  since 
unification  is  itself  commutative.  Lemma  14  states  that  -<g  is  a  noetherian  ordering.  This  will  allow  us 
to  use  it  as  the  ordering  for  an  inductive  proof. 

Lemma  1 4:  The  -<c  ordering  is  noetherian. 

Proof.  It  is  the  lexicographic  extension  of  two  instances  of  the  less  than  total  ordering  on  the  natural 
numbers.  E 

3.4.3  Some  Properties  of  the  Ordering 

The  inductive  hypothesis  within  the  proof  of  termination  can  only  be  applied  to  pairs  of  terms  that 
are  strictly  smaller  (in  this  case  by  the  -<c  ordering)  than  the  given  pair  of  terms.  We  will  use  the 
following  lemmas  to  show  that  recursive  calls  are  made  to  strictly  smaller  terms  than  the  input  terms. 
Lemmas  15  and  16  give  independent  conditions  that  are  each  sufficient  for  one  pair  of  terms  to  be 
less  than  another  by  the  ordering.  Lemma  15  shows  that  two  non-variable  proper  significant 
subterms  of  two  terms  have  strictly  smaller  complexity  than  the  two  terms:  it  will  apply  in  case  5  of 
CR-unify  where  the  algorithm  recurses  over  non-variable  arguments. 

Lemma  15:  Let  f',  and  s -be  non-variable  proper  significant  subterms  of  t  or  s,  then: 

<f',  s'>  -<£  <f,  s>. 


Proof.  Let  i'  ?  =  T(f.<).  t-2  =  v(t',  s')  and  t2  =  r(t',  s').  Since  t'  and  s’  are  subterms  of  f  or 

s.  all  vanubles  occurrences  in  /  or  s'  correspond  to  some  occurrence  in  f  or  s.  Since  t‘  and  s’  are 
non  variab'e  terms,  all  variables  in  t'  or  s'  will  occur  under  an  operator  they  occurred  under  in  t  or  s, 
so  i'2Cr,  and  s//e(<<2)  <s-zeU-}).  It  is  now  sufficient  to  show  which  in  this  case  is  independent 

of  whether  or  not  »’2  =  *'1- 

All  significant  subterms  of  /'  or  s'  are  significant  in  t  or  s  (because  t'  and  s'  are  themselves  significant 
in  f  or  s).  Therefore,  finding  one  significant  subterm  of  /  or  s  that  is  not  a  subterm  of  /'  or  s'  will  prove 
r2Crt  and  thus  size(r2)  <  stfe(T?).  Specifically,  we  will  show  that  either  l  is  not  a  subterm  of  t'  and  not 
a  subterm  of  s'  or  s  is  not  a  subterm  of  f’  and  not  a  subterms  of  s'  and,  each  being  a  significant 
subterm  of  itself,  satisfies  these  conditions  to  prove  size(r2)  <  size (r^. 

Assume,  without  loss  of  generality,  that  f  V/.  We  also  know  that  either  s  Vf  or  s  Vs: 

1.  If  sVf,  then  f  is  not  a  subterm  of  s'  and  t  is  not  a  subterm  of  f'. 

2.  If  s  Vs.  tnen  there  are  three  cases: 

A.  If  sV/'  then  s  Vf.  so  t  is  not  a  subterm  of  s'  and  t  is  not  a  subterm  of  f'. 

B.  If  f  Vs'  then  ?'-<s.  so  s  is  not  a  subterm  of  s'  and  s  is  not  a  subterms  of  /'. 

C.  If  t  Vs'  and  s'-//',  then  both  t  and  s  are  not  subterms  of  /'  or  of  s'. 

Therefore.  <f'.  s  >  -<£  <f.  s>.  □ 

Lemma  16  gives  sufficient  conditions  for  a  pair  of  subterms  of  two  terms  to  be  strictly  less  than  the 
two  terms  even  when  one  of  the  subterms  is  a  variable.  It  will  apply  in  case  B  of  CR  variable-unify 
where  the  algorithm  recurses  over  a  variable  and  a  non-variable  term. 

Lemma  16:  Let  s'  be  a  non-variable  proper  significant  subterm  of  t  or  s  and  let  x  be  a  variable  such 
that  size(ParSets(x ,  t)UParSets(x ,  s)}  >  1 ,  then: 

<) C,s'>-<£</,S>. 

Proof.  Let  i'1  =  »>(/, s),  t1  =  r(f,s),  i>2  =  v(x,  s'),  and  t2  =  t(x,s').  Since  s' is  a  subterm  of  f  or  s,  all 
variable  occurrences  in  s'  will  occur  under  the  same  operator  sets  in  either  t  or  s  and  since  x  will  be  in 
v,  its  new  occurrence  under  {«}  will  not  place  any  new  variables  in  v .  Therefore,  and 

size(v2)  isizeivj.  By  the  same  argument  as  in  Lemma  15,  t2Ctv  and  thus<» ,  s>  -<(■»</,  s>.  □ 

Lemmas  15  and  16  will  not  be  enough  to  show  that  the  complexity  decreases  with  recursion,  since 
map-uni fy  applies  the  accumulated  unifier  to  the  next  corresponding  pair  of  terms  before  calling 
CR-unify.  In  general,  applying  a  substitution  to  a  term  may  increase  its  complexity.  Therefore,  the 
following  set  of  definitions  and  lemmas  are  used  to  give  sufficient  conditions  on  a  substitution  and 
pair  of  terms  such  that  the  substitution  will  not  increase  the  complexity  of  the  pair  of  terms. 
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Each  substitution  created  directly  by  the  CR-unify  procedure  is  of  the  form  {x  r},  where  x  is  a 
variable  and  r  is  a  term.  A  substitution  of  this  form  has  a  domain  of  size  one  or  zero  and  will  be  called 
an  elementary  substitution.  The  following  definition  gives  sufficient  conditions  for  forming  elementary 
substitutions  for  a  pair  of  terms  to  assure  that  the  complexity  of  the  terms  will  not  increase  with 
application  of  the  substitution. 

Definition.  Let  <7  be  an  elementary  substitution,  a  is  said  to  be  elementary  non-increasing  for  t  and 
s  if  and  only  if  it  is  of  one  of  the  following  forms: 

1 .  {t «-  s},  where  t  is  a  variable  not  occurring  in  s,  or  similarly,  the  substitution  {s «-  f)  where  s  is 
a  variable  not  occurring  in  t. 

2.  {jt  «-  r},  where  x(f\r)  and  (ParSets(x ,  t)UParSets(x ,  s))ll(ParSets(r,  t)UParSets(r,  s))  *  0 

3.  { x  *■  y),  where  either  y  is  a  new  variable  or 

(ParSets(x ,  t)UParSets(x ,  s))f)(ParSets(y ,  t)UParSets{y ,  s))  0 

4.  {*  «-  r},  where  r  is  a  non-variable  homogeneous  term  such  that  x  $  f\r)  and 
[r.head]€ParSefs(jc ,  f)UParSefs(x ,  s)  and  V  y  €  T\r)  either  y  is  a  new  variable  or 
[r.head]€ParSets[y ,  t)\JParSets(y ,  s ) 

5.  {t  *■  r},  where  r  is  a  non-variable  homogeneous  term  such  that  fjr)  and 
[s.head]€ParSefs(f,  s)  and  r.head -s.head  and  VyCTfr)  either  y  is  a  new  variable  or 
[r./?ead]€ParSefs(>'1  t)UParSets{y,s).  (Similarly  for  t  and  s  reversed.) 

6.  {x  *■  r},  where  r  is  a  non-variable  homogeneous  term  such  that  x  $f(r)  and  there  exists  a 
significant  subterm,  r',  of  either  t  or  s  such  that  r.head  =  r  .head  and 

{ParSets{ x ,  t)UParSets{x ,  s))n(ParSefs(r',  f)UParSefs(r',  s))  *  0  and 
[r'.head]€ParSefs(x ,  r')  and  V  y  €T(r)  either  y  is  a  new  variable  or 
[r.head]€ParSe/s(y,  /)UParSefs(>',  s). 

Lemma  17  states  that  each  of  the  conditions  in  the  definition  of  elementary  non-increasing  sub¬ 
stitutions  is  sufficient  to  guarantee  that  application  of  such  a  substitution  will  not  cause  the  com¬ 
plexity  of  the  terms  to  increase  by  the  -<c  ordering. 

Lemma  17:  If  o  is  an  elementary  non-increasing  substitution  for  l  and  s,  then: 

<af,  s>. 

Proof.  Let  »>1  =  v(t,s),  t1  =  r(f,s),  i>2  =  v(ot,os),  and  t2  =  r(ot,os).  Consider  each  case  from  the 
definition  of  elementary  non-increasing  substitutions: 

1.  In  this  case  at  =  as  =  s  and  since  t  was  not  in  j^,  we  know  »'2  =  >'1  and  r2  =  r1  which  implies 
their  cardinalities  are  also  pairwise  equal. 


2.  Since  r  is  a  subterm  of  either  t  or  s,  *'2C^1.  If  *'2  =  *'1,  then  all  occurrences  of  x  must  have 
been  under  operators  from  a  single  sub  theory  (i.e.,  size(ParSets(x ,  /)UParSefs(x,  s))  =  1). 
Furthermore,  r  must  occur  under  an  operator  from  this  set  since  r  and  x  have  some  parent 
set  in  common.  Therefore,  the  new  occurrences  of  r  in  at  or  as  will  be  significant  if  and  only 
if  there  was  a  significant  occurrence  of  r  in  /  or  s  and,  thus,  r 2Qry 

3.  If  y  is  a  new  variable,  then  a  is  simply  a  variable  renaming,  so  size(v2)  =  size(v^).  Otherwise, 
there  is  a  common  parent  set  of  x  and  y,  so  size(v2)<size{vj.  In  either  case, 
size(r2)<size(r^). 

4.  Let  y€Y[r).  If  y  is  a  new  variable,  then  all  occurrences  of  y  in  at  or  as  are  in  rand,  since  r  is 
homogeneous,  size(ParSets(y ,  t)UParSets(y,  s))  =  1.  Otherwise,  there  is  a  common  parent 
set  of  y  and  x ,  so  y  will  occur  under  symbols  from  more  than  one  sub  theory  in  at  or  as 
only  if  either  x  or  y  did  in  t  or  s.  Therefore,  size(i>2)  s  size(v^).  If  size(i>2)  =  size(i>J,  then  x 
must  have  occurred  under  only  one  parent  set  and  this  set  is  [ r.head ].  Therefore,  any 
occurrences  of  r  in  at  or  as  will  not  be  significant  and  by  homogeneity  of  r,  none  of  its 
subterms  will  be  significant,  so  size{r2)  <  sizeirj. 

5.  t  occurs  under  {»}  and  [s.head],  so  size{ParSets(t,  t)LiParSets(t,  s))  >  1.  All  other  variables 
in  r  are  either  new,  and  by  homogeneity  of  r  have  only  one  parent  set,  or  already  occur 
under  [r.head]  in  f  or  s.  Therefore,  size{v2)  <  size(ej. 

6.  x  occurs  in  t  or  s  under  both  [r'.head]  and  the  set  of  symbols  distinct  from  [ r'.head )  under 
which  r'  occurs  to  make  it  significant.  Therefore,  x  occurs  under  more  than  one  set  of 
symbols  in  t  or  s  and  not  at  all  in  at.  All  other  variables  in  r  occur  under  multiple  parent  sets 
in  at  or  as  only  if  they  did  in  t  or  s  by  the  argument  in  case  (5),  so  v2  =  *»1— {or}. 

□ 

We  extend  the  definition  of  non  increasing  to  general  substitutions  as  well  as  elementary  ones  by 
considering  a  composition  of  elementary  non  increasing  substitutions. 

Definition.  If  cr  =  an°. ..“a,  and  a.  iselementary  non-increasing  fore. 1...a1f  and  <x.  t ...cr1s,  isn,  then 
a  is  said  to  be  non-increasing  for  t  and  s. 

Notice  that  the  definition  of  non -increasing  reflects  the  way  in  which  substitutions  are  built  in 
map-unity;  one  unifier  is  found  and  applied  to  the  inputs  before  next  unifier  is  found.  Lemma 
18  proves  the  desired  property  on  non-increasing  substitutions,  i.e.,  applying  a  non-increasing  sub¬ 
stitution  to  the  given  pair  of  terms  does  not  increase  their  complexity.  It  extends  the  property  in 
Lemma  17  to  substitutions  that  are  not  elementary. 
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Lemma  18:  If  a  is  a  non  increasing  substitution  for  f  and  s,  then: 

<al ,  os>^g<t,  s>. 

Proof.  By  induction  on  n  using  Lemma  17.  □ 

Lemma  1 9:  Let  t‘.  s'  be  subterms  of  f  or  s  such  that 

(ParSets{t '.  i)UParSets(f\  s))f!(ParSefs(s'.  t)UParSefs(s'.  s))  *  0.  If  a  is  non-increasing  for  t'  and  s', 
then  a  is  non  increasing  for  t  and  s. 

Proof.  The  proof  is  done  by  examining  the  elementary  factors  of  a  to  show  that  each  of  the  con¬ 
ditions  that  makes  it  non -increasing  for  t'  and  s'  corresponds  to  some  conditions  that  make  it  non¬ 
increasing  for  t  and  s. 

1.  {f’  «■  s’}  corresponds  to  form  (2)  for  t  and  s. 

2.  { x  «■  r}.  where  r  is  a  significant  subterm  of  t  or  s'  is  still  form  (2)  for  t  and  s  since 
ParSetsfx ,  OUParSefsO  .  s  )  C  ParSetsf  x ,  f)  U  ParSets(.x .  s)  and 
ParSe1s{r,  t')L)ParSets(r.  s')  C  ParSets(r,t)  U  ParSets(r,  s). 

3  { x  *■  y  }  for  /  and  s'  is  still  form  (3)  for  t  and  s 
ParSets(x ,  t')DParSets{  x ,  s')  C  ParSe1s(x ,  t)  U  ParSets(x ,  s)  and 
ParSetsO ■,  t')UParSets(\  ,s')C  ParSets{\  ,t)  U  ParSetsiv ,  s). 

4.  { x  «■  r).  where  r  is  a  homogeneous  term  is  also  form  (4)  for  f  and  s  since 
ParSetsfx,  t‘)\JParSels(x ,  s')  C  ParSets(x  ,t)  U  ParSets(x ,  s)  and 

Vy €Y[r)  if  [r.headJCfParSe/sO  -f'JUParSefsO’  .s')),  then 
[r.head]6(ParSe/sO ,t)UParSets(y,s)). 

5.  { t  *-  r),  where  r  is  homogeneous  in  [s.head]  is  form  (6)  for  t  and  s.  The  term  s'  acts  as  r  in 
form  (6)  of  the  definition. 

6.  {jc  r },  where  r  is  homogeneous  and  there  is  an  r  as  described,  is  still  form  (6).  since  all 

significant  subterms  of  t'  and  s'  (including  t'  and  s',  themselves)  are  significant  in  t  or  s. 

□ 

The  following  technical  definition  will  be  used  in  our  proof  of  termination  to  show  that  non-trivial 
recursive  calls  are  made  only  to  pairs  of  terms  that  are  significant  and  appear  under  a  common  set  of 
symbols  in  the  input  terms  with  accumulated  partial  unifiers  applied.  The  property  is  not  obvious  in 
the  proof  since  we  are  building  the  corresponding  pairs  of  terms  out  of  substitutions  p  and  y,  not 
directly  from  t  and  s. 

Definition.  Let  t  and  s  be  two  homogeneous  terms  in  F,  the  constrained  symbols  for  a  sub-theory  of 
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E*  and  let  p  be  a  substitution  that  is  homogeneous  in  F.  Furthermore,  assume  either  f  or  s  is  non¬ 
variable.  A  substitution,  o,  is  said  to  be  parent  preserving  for  homogeneous  f,  s,  and  p,  if  and  only  if: 

(a  €X  &  v  £2t(<j)U3(0))  =>  F£ParSets(ox ,  at)  U  ParSets(ax ,  as), 
where  X  =  TU)Urts)U3(p). 

Informally,  X  contains  the  variables  from  t  or  s  that  were  not  cut  off  though  homogenization,  the 
new  variables  from  homogenization,  and  any  new  variables  from  p.  The  intuition  is  that  if  one  of  the 
variables  occurs  in  the  range  variables  of  a,  then  it  must  occur  somewhere  in  at  or  as,  and,  further¬ 
more,  one  of  those  occurrences  must  be  under  F.  In  addition,  if  one  of  the  variables,  x,  occurs  in  the 
domain  of  0,  then  the  range  element  0*  must  occur  in  0f  or  as  under  F.  It  is  called  the  parent 
preserving  property  because  the  variables  in  X  all  occur  under  F  in  either  f,  s,  pf  or  ps,  and  we  want 
this  parent  relationship  to  be  preserved,  even  when  the  unifier,  a,  is  only  partially  formed. 

3.4.4  The  Proof 

Theorem  V  is  the  termination  theorem  for  the  CR-unify  procedure.  The  proof  is  by  noetherian 
induction  on  the  complexity  of  terms  in  the  calls  to  CR-unily.  The  induction  step  is  proved  using  a 
second  induction  on  the  number  of  calls  to  CR-unify  made  from  a  single  invocation  of  map-unify. 

The  proof  is  quite  long  and  involved.  Before  presenting  the  proof  in  its  entirety,  we  will  give  a 
short  outline  of  the  proof's  structure.  This  should  give  the  reader  a  feeling  for  the  purpose  of  each 
step  in  the  proof  and  also  act  as  a  check  list  for  the  things  that  have  been  proved  and  the  things  still 
left.  The  proof  is  by  induction  on  t  and  s  using  the  -<£  ordering.  The  proof  uses  a  stronger  induction 
hypothesis  than  the  property  of  termination  alone.  In  addition  to  termination,  we  prove  that  returned 
substitutions  are  non-increasing  for  t  and  s. 

Proof  Idea. 

1.  Basis:  Cases  1  and  4  of  CR-unify  and  cases  A  and  C  of  CR-variable-unily,  as  invoked  in  case 
2  or  3  of  CR-unify.  We  need  to  prove  that  each  step  terminates  and  that  the  returned 
substitution  is  non-increasing  for  t  and  s. 

2.  Inductive  step:  Case  5  of  CR-unify  and  case  B  of  CR-variable-unify  (as  called  by  case  2  or  3 
of  CR-unify)  are  the  inductive  cases,  since  these  two  cases  require  recursive  steps.  These 
cases  can  be  considered  together,  since  both  t  and  f  for  case  5  are  v  in  case  B.  In  this  step 
we  will  show  that  calls  to  map-unify  terminate.  For  simplicity,  we  treat  the  algorithms  as  if 
they  always  pick  some  element  of  a  returned  set  of  unifiers  rather  than  exhaustively  trying 
each  element  of  the  set.  Since  these  sets  of  unifiers  are  finite,  this  simplification  does  not 
affect  the  soundness  of  our  proof. 


We  will  perform  an  induction  on  the  variables  in  the  domain  of  y  and  p.  These  variables  are 
divided  into  two  sets  denoted  V1  and  V2,  where  V2  contains  variables  representing  non¬ 
variable  subterms  of  t  or  s  and  V(  contains  variables  from  the  original  t  or  s.  The  induction 
on  these  variables  is  divided  into  two  parts,  one  for  V1  and  one  for  V2. 

In  addition  to  proving  the  termination  of  each  step  and  the  non-increasing  nature  of  unifiers, 
we  will  show  that  <rj  is  parent  preserving  for  f,  s,  and  p.  The  proof  that  <r  is  paren.  preserv¬ 
ing,  will  depend  only  on  the  induction  on  i,  not  on  the  induction  on  t  and  s  since  in  the  base 
cases  p,  f,  and  3  are  not  defined. 

A.  This  step  is  for  variables  in  Vv 

i)  Basis:  i  =  0,  the  first  variable  in  the  Vr 

ii)  Inductive  step:  i>0,  the  rest  of  the  variables  in  Vr 

Show  that  this  step  terminates  and  any  substitution  formed  is  both  non-increasing 
for  t  and  s  and  parent  preserving  for  f,  3,  and  p. 

B.  This  step  is  for  variables  in  V2. 

i)  Basis:  Since  V1  and  V2  are  processed  in  order,  this  is  simply  the  last  case  in  Vr 

ii)  Inductive  step:  For  the  variables  in  V2 

Show  that  this  step  terminates  and,  again,  that  any  substitution  formed  is  both  non¬ 
increasing  for  t  and  s  and  parent  preserving  for  f,  3,  and  p.  We  break  this  step 
into  five  cases  depending  on  the  value  of  Sj,  the  term  formed  from  p,  and  f.,  the 
term  formed  from  y. 

Given  this  rough  outline,  we  now  give  the  termination  theorem  with  complete  proof. 

Theorem  V:  For  any  terms,  t  and  s,  CR-unify(l,  s)  terminates. 

Proof.  By  induction  on  t  and  s  using  the  -<£  ordering. 

1.  Basis:  Cases  1  and  4  of  CR-unify  and  cases  A  and  C  of  CR-variable-unify,  as  invoked  in  case 
2  or  3  of  CR-unify.  In  each  case  termination  is  obvious;  We  will  prove  that  a  returned 
substitution,  a,  is  non-increasing  for  t  and  s. 

A.  Ca°e  1:  <r  =  {f«-s)  is  non  increasing  because  it  is  either  the  identity  substitution  or  of 
form  (1)  in  the  definition  of  elementary  non -increasing. 

B.  Case  A:  a  =  {f«-s}  is  of  form  (1 )  in  the  definition  of  elementary  non -increasing. 


C  Case  C  The  empty  set  of  substitutions  is  returned,  so  the  properties  are  vacuously  true. 


D  Case  4  Again,  the  empty  set  of  substitutions  is  returned. 

2.  Inductive  step:  Case  5  of  CR-umty  and  case  B  of  CR-variable-unify  (as  called  by  case  2  or  3 
of  CR-um', )  are  the  inductive  cases.  Let  E*  be  the  sub  theory  of  interest  and  F  be  the  set  of 
function  symbols  constrained  by  E.\  i.e.,  F  =  [s.head], 

E^CR -unify  terminates  by  the  basic  assumption  on  sub  theory  unification  procedures. 

Map-unify  is  called  with  y.  the  preserving  substitution  for  t  and  s,  and  p,  an  Ej-unifier.  Induct 
over  i,  the  number  of  iterations  in  map-unify.  and  for  each  i  use  the  following  notation: 
,|€‘J(y)U'J(p) 

f,  =  <Vi°*v, 

si-0i.i0Prt 

u  €CR-unify(t .  s .) 
i  ’ '  I  r 

Using  this  notation,  we  know  <r  is  of  the  form: 

°i  =  ui°wM0 -Ow0- 

The  set  of  domain  variables  is  divided  into  and  V2:  we  will  consider  each  case  separately, 
first  performing  induction  on  i  where  and  then  where  >  ,£V2  We  show  that  o.  is 
parent  preserving  for  f,  s,  and  p. 

A.  VjCV,  =  3(pH I(y):  We  know  by  construction  of  V,  that  y\  {  =  v..  In  this  case,  each 
unifier  is  of  the  form  =  {v>o(  pi»t). 

i)  Basis:  If  i  =  0,  then  no  calls  to  CR-unify  have  been  made  and  termination  is  obvious. 
The  only  substitution  is  e0  =  t,  which  is  trivially  non  increasing  and  parent  preserv¬ 
ing. 

ii)  Inductive  step:  iX). 

ffj.,  i°...°«0  anc*  by  the  induction  hypothesis  on  i,  the  u  s  are  all  elementary 
substitutions  of  the  form  Wj  =  {v.«-a.  tpv .}.  Therefore,  3(a.  1)Q{v1 . r  }  and,  fur¬ 

thermore,  vj^(aj  j),  so  tj-aj ^yv.  =  a.  v(. 

Also,  by  construction  of  Vt  we  know  v^(p),  which  implies  >>.  C5(p)  and  thus 
VjC3(Oj ,).  Therefore,  vCTla.^pv.). 

Since  t. does  not  occur  in  s.  =  ff.pv.,  the  unifier  is  u>.  =  {t.«-sj}  =  {v^cr  ,  pv.  1},  which  is 
case  A  of  CR-variable-unify,  and  obviously  a  terminating  case. 


Uj,  is  homogeneous  in  F  and  elementary  non  increasing  for  oMf  and  <r 0s  because  it 
is  of  form  (3),  (4),  or  (5)  in  the  definition  of  elementary  non-increasing.  There¬ 
fore,  at  is  non  increasing  for  t  and  s  by  Lemma  18. 

i  occurs  under  F  in  either  t  or  s,  and  a.  is  homogeneous  in  F,  so  the  parent  preserv¬ 
ing  hypothesis  holds.  (Note:  this  is  true  even  if  v(  is  T  or  sj  as  in  case  B  of 
CR-variable-unity ,  since  ^  must  have  another  occurrence  under  F  in  the  non¬ 
variable  term.) 

B.  Vj€V2  =  9(y): 

i)  Basis:  The  proof  for  v  in  V1  proves  the  base  case  for  V2. 

ii)  Induction  Step: 

Since  v.€3(y)  we  know  y  v  and  thus  t.  is  a  non-variable  such  that  the  head  symbol  is 
not  in  F.  Therefore,  T  is  a  proper  significant  subterm  of  <r.  ^  or  <Xj  1s. 

There  are  five  cases  to  consider  for  s.. 

a)  If  s;  is  a  non-variable  such  that  srhead*Lheacl,  then  the  recursive  call  ter¬ 
minates  with  the  empty  set  in  case  4  of  CR-unify. 

b)  If  s.  is  a  variable  such  that  s.Cflt.)  &  sCSfPreserveff.,  /.)),  then  the  termination 
halts  with  the  empty  set  in  case  C  of  CR-variable-unify. 

c)  If  Sj  is  a  variable  such  that  SjCtty)  then  the  single  substitution  returned  by 
CR-unify(tv  s.)  is  Wj  = 

<jj  is  of  form  (2)  for  crj  ^  and  o^s  in  the  definition  of  elementary  non¬ 
increasing,  and  since  the  induction  hypothesis  on  i  implies  o.  ^  is  non¬ 
increasing  for  t  and  s,  <x.  =  ^  is  non-increasing  for  t  and  s. 

Termination  is  immediate  in  the  recursive  call  since  f.  and  Sj  fit  case  A  of 
CR-variable-unify  as  called  from  case  3  of  CR-unify. 


To  show  the  parent  preserving  hypothesis  on  a.,  we  will  consider  two  cases: 


the  variables  in  the  domain  of  and  the  variables  in  the  range  of  a.. 


Recall  the  variables  of  interest,  X,  are  either  variables  in  t  or  s  with  occur¬ 
rences  under  F,  new  variables  from  homogenization,  or  new  variables 


from  E.-unify. 


Consider  x  in  ^(a^,  and  show  that  a.x  will  occur  under  F  in  either  of  or 
^s.  This  is  evident  not  because  of  any  conditions  of  x ,  but  because 
yVj  occurred  under  F  in  t  or  s  and  therefore  a..*  =  Sj  =  o{  yv ( will  occur 
under  F  in  a  t  or  a  s. 

I  I 

Let  x  be  in  Jfcr)  and  show  that  x  will  occur  under  F  in  ot  or  os.  If 
jr€3 (cr)  then  either  x  €3{ct.  ?)  or  jr  £1^).  Variables,  other  than  those 
in  9(«j),  that  are  in  )(oi ,)  will  occur  under  F  by  the  induction 
hypothesis  on  i.  Variables  in  s(  =  <r ^yv.  are  either  in  f  or  s  or  in  jfa  1). 
In  the  first  case,  if  variables  occur  in  t  or  s  and  in  X,  then  they  must 
have  occurred  under  F  in  either  t  or  s.  In  the  second  case,  variables 
in  Oj ^  and  X  will  occur  under  F  in  o  ^  or  Oj by  the  induction 
hypothesis  and  thus  under  F  in  o7  or  OjS. 
d)  If  s  is  a  non  variable  such  that  s  .head  ~  t.head,  then: 

Since  [fj./ieadJ^F  we  know  [Sj.head]*F.  But  Sj  =  ‘ri.1P‘'j.  so  p  maps  ^  to  itself 
or  to  some  other  variable.  In  either  case  there  is  some  variable,  y^[oj  .) 
and  ,y€X  such  that  o{  ,y  =  sr  Therefore,  by  the  parent  preserving 
property  on  oM,  s(  occurs  under  F  in  either  a.  ,1  or  ot ^s. 

Therefore,  tj  and  s.  are  both  non-variables  significant  subterms  of  oMf  or 
o  .s  and  by  Lemma  15,  <r.,  s.>-</.  <0.  J,  o.  ,s>. 

1*1  I  I  V-r  1-1  1-1 

By  the  inductive  hypothesis  on  i,  <7M  is  non-increasing  for  t  and  s  and  thus 
<ffiV-  s>  ^  Lemma  18. 

By  transitivity,  </.,  s>.  I.e.,  the  recursive  call  is  made  to  strictly 

smaller  inputs  than  the  original  inputs,  and  we  can  apply  the  noetherian 
induction  hypothesis  to  prove  termination. 

By  the  induction  hypothesis  on  /.  and  s.,  if  Wj  is  a  returned  substitution,  then 
«j  is  non -increasing  for  t.  and  sf,  and  by  Lemma  19  for  t  and  s  since  t.  and 
Sj  both  occur  under  F. 

The  parent  preserving  property  is  proved  by  showing  that  if  x  €9{o\)U3(cr.) 
then  ffjjr  occurs  under  F  in  of  or  a.s.  Since  oj  =  uj°oj  r  x  is  either  in 
3{<7j _1)U3(ffj  t)  or  in  SJ(Wj)U3{«j).  In  the  first  case,  the  parent  preserving 
property  in  <7.  follows  directly  from  the  induction  hypothesis  on  o.  .  In  the 
second  case,  if  x  is  in  3(«j)  then  it  must  be  in  1.  or  s.,  and  if  x  is  in  5(«.) 


then  it  must  be  in  t ,  s  or  it  must  be  a  new  variable  and  therefore  not  be  in 

I  I 

X.  We  tnerefore  need  only  consider  the  variables  in  r  and  sf.  The  vari¬ 
ables  in  t  are  either  in  3(-y).  and  thus  in  t  or  s.  or  they  are  in  5(o  t);  the 
variables  in  are  all  in  3(a ,).  All  variables  in  3(0^,)  are  covered  by  the 
induction  hypothesis,  and  variables  in  t  or  s  occur  in  X  only  if  they  occur 
under  F,  so  w  and  hence  <r  are  parent  preserving. 

e)  If  S|  is  a  variable  such  that  s£ %\tt)  and  s$.5(Preserve(f.,  r),  then: 

Since  =  at  ,pi  is  not  in  t  or  s  but  is  in  r,  we  know  sjC5<cr._1). 

t(€f{s()  =>  f|C3(0)  ^p).  Therefore,  either  f(€3(p)  or  there  exists  some  v €3(p) 
such  that!  =  a  ,v. 

i  i-i 

In  either  case,  the  inductive  hypothesis  applies  to  give  the  parent  preserving 
property  on  a|  1  and  s-(i)  occurs  under  F. 

But.  also  occurs  under  [t.head].  Therefore.  size(ParSets(s.,  u| ^OU 
ParSeisls .  O'  s))  >  1  and  by  Lemma  16,  <f(,  s>-<Q<ai  ,f,  o(  ,,s>. 

By  the  same  arguments  as  in  case  4,  <r,  s> <f,  s>  and  we  can  use  the 
inductive  hypotheses  to  show  .,  is  parent  preserving  and  non¬ 

increasing  for  rand  s. 


Chapter  Four 
Conclusions 


This  chapter  summarizes  the  main  contributions  of  this  thesis  and  suggests  areas  for  future  work. 
We  look  specifically  at  possible  extensions  of  CR-unity  and  some  issues  and  related  to  the  complexity 
and  efficiency  of  our  approach. 


4.1  Contributions 

In  this  thesis  we  defined  a  generalization  to  the  equational  unification  problem  which  we  called 
combined  unification.  The  problem  is  to  take  a  set  of  equational  theories,  each  of  which  has  a  known 
equational  unification  algorithm,  and  automatically  produce  an  algorithm  for  the  union  of  the  theories. 
The  problem  solved  here,  where  we  restricted  ourselves  to  confined  regular  theories,  was  mentioned 
as  an  open  problem  by[Shostak  84]  as  a  generalization  to  his  work  in  combining  decision 
procedures.  It  is  also  described  as  an  open  problem  in  [Siekmann  84],  where  it  is  called  the 
combination  of  theories  problem. 

The  combined  unification  problem  was  motivated  both  by  a  theoretical  interest  in  learning  more 
about  equational  unification  and  a  pragmatic  interest  in  using  equational  unification  in  larger  systems 
such  as  theorem  provers.  Many  applications  of  unification  involve  reasoning  about  an  arbitrary  sets 
of  operators  with  different  set  of  equational  properties.  For  example,  in  an  automatic  theorem  prover 
the  set  of  needed  equational  theories  will  depend  on  the  theorem  to  be  proved.  Moreover,  given  the 
difficulty  of  designing  equational  unification  algorithms,  it  is  not  reasonable  to  assume  the  algorithm 
will  be  designed  "on  the  fly,"  while  proving  the  theorem.  These  considerations  lead  us  to  conclude 
two  things:  systems  based  on  equational  unification  must  have  a  large  set  of  built-in  theories,  and,  this 
set  must  be  easily  extendible  so  that  new  theories  can  be  incorporated  as  they  are  deemed  inter¬ 
esting. 

Our  approach  was  to  describe  a  unification  algorithm  for  equational  theories  that  are  the  union  of 
theories  having  known  unification  algorithms.  The  main  results  of  this  thesis  are: 

•  a  careful  definition  of  the  combined  unification  problem, 

•  characterization  of  a  sub-problem  of  combined  unification  for  which  a  solution  was  pos¬ 
sible,  and  for  which  extensions  outside  the  sub -problem  domain  are  difficult, 


■  design  of  an  algorithm  for  combining  equational  unification  algorithms  for  confined 
regular  theories, 

-  proofs  of  consistency,  completeness,  and  termination  for  the  algorithm, 

•  a  method  for  a  lifting  the  variable-only  case  to  the  case  with  free  symbols  and  multiple 
instances  using  our  combining  algorithm, 

-  an  implementation  of  the  algorithm. 

Our  definition  of  the  combined  unification  problem  is  based  on  the  existence  of  what  we  called  a 
partitioned  presentation  of  the  theory.  A  partitioning  on  the  presentation  characterizes  the  indepen¬ 
dence  of  its  sub-theories,  i.e.,  the  theory  presented  by  each  of  the  partitions.  This  intuitive  notion  of 
independent  sub-theories  corresponds  to  having  disjoint  sets  of  constrained  symbols  for  each  sub¬ 
theory.  Independence  is  essential  to  the  ability  to  automatically  combine  unification  algorithms  as 
evidenced  by  the  fact  that  combining  three  theories  with  decidable  unification  problems,  namely 
associativity,  left  distributivity  and  right  distributivity,  results  in  a  theory  with  an  undecidable  unifica¬ 
tion  problem.  (See  Figure  1-4  for  references.) 

The  sub  problem  of  combined  unification  that  we  chose  to  solve  was  combining  confined  regular 
theories.  Both  of  these  properties  give  sufficient  conditions  for  finding  pairs  of  terms  that  are  not 
unifiable.  We  defined  a  confined  theory  to  be  one  in  which  there  are  no  equations  with  a  variable 
equal  to  a  non-variable  term.  This  gave  us  the  invariant  that  any  two  equal  non-variable  terms  have 
head  symbols  constrained  by  the  same  sub-theory.  A  regular  theory  is  a  theory  in  which  the  right  and 
left  side  of  each  equation  contains  the  same  set  of  variables.  Together  with  the  restriction  to  confined 
theories,  the  regularity  restriction  guarantees  that  every  pair  of  equal  terms  has  equal  sets  of  sig¬ 
nificant  subterms. 

We  also  showed  in  Section  2.4  that  confinement  and  regularity  were  not  restrictions  of  con¬ 
venience,  but  that  both  our  algorithm  and  a  number  of  simple  extensions  to  it  were  not  correct  for 
unconfined  or  non-regular  theories.  Taken  collectively,  the  examples  in  Section  2.4  point  out  a 
conflict  between  the  goals  of  completeness  and  termination  of  E-unification  procedures.  In  many 
cases  it  was  possible  to  either  prove  completeness  assuming  termination  or  to  prove  termination  of  an 
incomplete  procedure. 

The  examples  in  Section  2.4  also  serve  as  counter-examples  to  the  correctness  of  some  existing 
E  unification  algorithms  when  more  than  one  instance  or  free  symbols  are  allowed.  The  abelian 
group  unification  algorithm  and  the  algorithm  for  AC  unification  with  either  idempotence  or  unit  are 
examples  of  algorithms  that  are  not  correct  in  the  more  general  cases.  The  ACI  and  ACU  theories 


have  vanab'e  only  case  algorithms  defined  in  [Livesey  76).  [Fages  83b)  shows  termination  for  the 
general  cases  of  ACI  and  ACU  but  neither  his  algorithm  nor  the  algorithms  in  [Livesey  76]  are  com¬ 
plete  in  these  cases,  in  particular,  they  do  not  find  a  unifier  when  unifying  a  variable  with  a  non- 
variable  term  containing  the  variable  For  the  same  reason,  the  algorithm  for  abelian  group  unifica¬ 
tion  (Lankford  84]  is  incomplete  in  the  general  cases. 

Our  algorithm  is  a  combining  algorithm  for  the  confined  regular  theories.  It  automatically  builds 
an  algorithm  for  the  combined  unification  problem  by  dividing  the  combined  problem  into  a  number  of 
variable-only  problems,  invoking  algorithms  for  the  sub-theories  on  those  problems,  and  recombining 
the  answers.  A  homogenizing  function  is  used  for  the  problem  division  and  unification  of  substitu¬ 
tions  used  for  the  recombining.  The  technique  requires  no  redesign  of  the  sub-theory  algorithms,  and 
no  theory-specific  computation  in  the  generalized  algorithm.  The  restrictions  of  regularity  and  con¬ 
finement  are  implicitly  used  for  correctness,  but  are  never  explicitly  tested  by  the  algorithm. 

The  practical  significance  of  our  results  depend  upon  the  existence  of  confined  regular  theories 
for  which  the  combined  unification  problem  was  previously  unsolved,  and  for  which  the  theories 
describe  "interesting"  properties.  The  permutative  theories  are  examples  of  such  theories.  The 
insert  operation  on  the  set  data  type,  for  example,  has  the  property  of  right  commutativity,  a  permuta¬ 
tive  property  that  also  has  known  unification  algorithm. 

An  algorithm  for  unifying  permutative  axioms  is  useful  in  term  rewriting  system  completion 
procedures  and  in  resolution,  because  both  of  these  procedures  make  use  of  classical  unification  and 
are  limited  by  the  necessity  of  maintaining  termination.  The  permutative  axioms  are  a  class  that  will 
always  lead  to  termination  problems,  since  they  can  be  applied  repeatedly.  The  non  permutative 
axioms  can  often  be  handled  directly  by  the  application,  such  as  resolution  or  term  rewriting,  without 
loosing  termination.  It  is  often  the  case  that  equations  leading  to  termination  probiems  in  an  applica¬ 
tion  can  be  handled  by  combined  unification,  while  the  equations  leading  to  problems  in  combined 
unification  can  be  handled  by  the  application.  Referring  again  to  the  set  example,  we  note  that  most 
axioms  for  this  data  type  can  be  handled  in  a  term  rewriting  system  by  placing  them  in  the  rules.  An 
exception  is  the  right  commutativity  of  insert,  which  would  violate  properties  necessary  for  termina¬ 
tion.  Right  commutativity  can  be  handled  by  our  CR-unify  algorithm,  and  is  allowed  in  the  unification 
algorithm  in  an  equational  tern  rewriting  system.  We  therefore  have  a  method  for  proving  theorems 
about  sets  of  terms  or  sets  of  sets  by  incorporating  the  combining  unification  algorithm  with  right 
commutative  unification  into  an  equational  term  rewriting  system. 

Another  class  of  theories  we  have  discussed  are  those  theories  presented  by  only  ground  equa- 
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tions.  These  theories  have  an  automatically  generated  unification  algorithm  through  narrowing 
[Hullot  80],  and  are  also  confined  and  regular.  While  the  case  with  free  symbols  and  multiple  in¬ 
stance  case  are  solved  by  the  narrowing  process,  our  algorithm  provides  a  method  for  combining 
these  narrowing  unification  algorithms  with  arbitrary  other  algorithms  for  confined  regular  theories. 

A  proof  of  total  correctness  for  the  CR-unify  algorithm  was  given  in  Chapter  3.  The  proof  was 
presented  in  three  separate  theorems:  Theorem  III  shows  consistency,  Theorem  IV  completeness, 
and  Theorem  V  termination.  The  consistency  and  termination  properties  are  proved  for  all  theories, 
while  completeness  is  proved  for  the  regular  confined  theories.  The  consistency  and  completeness 
proofs  assume  termination,  and  therefore  constitute  a  partial  completeness  argument.  Combined 
with  the  termination  proof  this  shows  total  correctness.  The  termination  proof  wa  j  ->ased  on  Fages’s 
proof  of  termination  for  AC-unification.  However,  the  extension  was  non-trivial,  particularly  because 
case  B  of  CR-unify  is  a  failure  case  in  the  AC  theory,  whereas  case  B  is  a  recursive  case  in  CR-unify  to 
handle  theories  presented  by  ground  equations. 

A  consequence  of  our  method  and  the  proofs  is  that  any  variable-only  case  algorithm  for  an 
equational  theory  E  can  be  automatically  lifted  to  both  the  multiple  instance  case  and  case  with  free 
symbols.  The  importance  of  this  is  demonstrated  by  considering  an  example.  A  unification  algorithm 
for  the  AC  theory  was  originally  described  by  Stickel  in  1975,  and  although  the  procedure  was  pur¬ 
ported  to  solve  the  general  cases  for  AC,  termination  in  the  general  case  was  unproven  until  1983, 
when  it  was  solved  by  Fages.  The  AC  theory  is  one  example  of  a  confined  and  regular  theory;  Fages’s 
generalization  of  Shekel's  algorithm  is  a  special  case  of  our  CR-unify  just  as  his  termination  proof  is  a 
special  case  of  our  termination  proof  for  CR-unify.  The  unification  problem  with  free  symbols  is  also 
immediately  solved  by  our  algorithm  for  any  theory  with  a  variable-only  case  algorithm,  because  the 
unconstrained  symbols  are  handled  by  the  empty  theory  unification  algorithm. 

The  ability  to  go  directly  from  a  variable-only  case  solution  to  the  more  general  solutions  also  give 
us  some  bounds  on  the  number  of  unifiers  in  certain  theories.  As  a  corollary  to  our  proof  of  total 
correctness,  termination  in  the  variable-only  case  of  a  confined  and  regular  theory  implies  termination 
in  both  the  multiple  instance  case  and  the  case  with  free  symbols  of  this  theory.  This  in  turn  implies 
the  existence  of  a  finite  complete  set  of  unifiers  in  these  more  general  cases.  Conversely,  if  either  the 
case  with  free  symbols  or  the  multiple  instance  case  is  known  to  be  infinite  for  a  given  theory,  then  the 
variable  only  case  must  also  be  infinite. 

The  CR-unify  algorithm  has  been  implemented  as  part  a  general  effoit  to  extend  the  REVE  term 
rewriting  system  generator  [Lescanne  83,  Forgaard  84b]  to  equational  term  rewriting  systems 


[Kirchner  84b].  The  implementation  supports  the  generalized  unification  algorithm  and  allows  for 
simple  modular  extension  to  new  sub-theories  as  their  unification  algorithms  are  implemented.  In  the 
current  version,  the  unification  algorithms  for  the  AC  and  empty  theories  have  been  implemented. 
The  implementation  of  the  REVE  system,  including  our  unification  algorithm,  was  done  in  CLU  [Liskov 
81]. 

For  efficiency  reasons,  the  implementation  differs  from  the  description  given  in  this  thesis.  Each 
E-unification  procedure  is  implemented  to  perform  unification  in  a  single  equational  theory,  making 
no  assumptions  about  the  properties  of  the  operators  in  the  subterms,  but  recursively  calling  the 
top-level  general  unification  procedure  to  unify  subterms  rather  than  returning  homogeneous  sub¬ 
stitutions  and  combining  them  through  unification  of  substitutions.  This  eliminates  the  overhead  of 
forming  the  homogeneous  terms.  The  sets  of  substitutions  that  appear  in  both  CR-unify  and 
map-unity  are  replaced  in  the  implementation  with  CLU  iterators.  This  simplifies  the  code  and  may 
allow  for  combining  non-terminating  unification  procedures,  such  as  that  for  the  associative  theory 
[Plotkin  72],  Finally,  as  in  Fages’s  implementation  of  AC-unification,  the  order  of  recursion  imposed 
in  map-unity  by  forming  V(  and  V2  is  not  preserved.  I.e.,  variable  elements  in  %(y)  are  not  necessarily 
unified  first.  The  assumption  on  the  order  of  recursion  is  made  only  to  simplify  the  termination  proof. 

4.2  Future  Work 

The  work  in  this  thesis  has  suggest  some  areas  for  further  research.  We  will  discuss,  on  a 
pragmatic  level,  some  techniques  for  improving  the  efficiency  of  our  algorithm.  We  also  consider  the 
problem  of  weakening  the  restrictions  on  the  equational  theories  allowed  in  the  combining  algorithm, 
and  present  directions  for  further  work  in  this  area. 

4.2.1  Efficiency  Issues 

The  feasibility  of  using  E-unification  in  applications  will  depend  in  part  on  the  ability  to  find 
reasonably  efficient  algorithms  for  performing  the  unification.  Although  the  combined  unification 
problems  are  inherently  hard  for  many  interesting  theories,  there  are  a  number  of  optimizations  that 
will  improve  the  running  time  in  practice. 

One  of  these  optimizations  comes  from  the  difference  between  U-Homog  and  Homog,  defined  in 
this  thesis.  CR-unify  is  still  correct  if  U-Homog  or  other  homogenizing  functions  are  used  in  place  of 
Homog  in  the  procedure.-  There  is  a  trade-off  between  the  efficiency  of  the  homogenizing  function 
and  the  number  of  unifiers  of  the  homogenized  terms,  since  terms  with  multiple  occurrences  of 


vanables  have  fewer  unifiers  than  terms  in  which  each  variable  is  unique.  The  best  homogenizing 
function  may  be  one  that  performs  differently  depending  on  the  theory  of  the  head  symbols. 

The  order  of  recursive  calls  is  also  very  important  to  the  running  time  of  combined  unification.  In 
practice  it  is  best  to  perform  the  simplest  recursive  calls  first,  especially  when  they  will  lead  to  failure. 
Ordering  of  recursive  calls  can  be  done  partially  on  the  basis  of  the  kind  of  terms  to  be  unified  (i.e., 
variable  versus  non  variable)  and  partially  on  the  basis  of  the  relative  difficulty  of  performing  a  unifica¬ 
tion  in  the  different  theories  [Fages  85], 

Perhaps  the  most  interesting  class  of  optimizations  would  involve  weakening  our  strict  boundaries 
between  sub  theory  unification  algorithms.  There  are  sufficient  conditions  on  terms  for  non- 
unifiability  that  can  be  checked  very  quickly,  e  g.,  clash  of  head  symbols.  Sometimes,  a  sub-theory 
unification  algorithm  can  make  use  of  this  information  during  its  processing  and  thereby  never 
produce  unifiers  that  would  require  two  obviously  non-unifiable  terms  to  be  unified.  A  clash  between 
symbols  from  differe;  *.  sub  theories  could  be  detected  in  the  current  structure  of  the  algorithm,  but 
more  general  kinds  of  checks  for  non-unifiability  are  specific  to  a  theory  and  would  therefore  requires 
sharing  information  between  unification  algorithms.  Our  current  implementation  does  not  make  use 
of  this  kind  of  information  because  the  emphasis  was  on  correctness  and  modularity  of  the  program 
rather  than  efficiency. 

A  measure  of  complexity  that  exists  for  unification  problems,  and  is  also  related  to  the  efficiency  in 
practice,  is  minimality  of  a  solution.  An  algorithm  that  is  totally  correct  but  produces  many  non- 
minimal  unifiers  will  be  too  inefficient  for  some  applications.  Minimality  is  probably  too  expensive  to 
require  of  unification  algorithms,  since  in  some  theories  this  would  require  an  exponential  filtering 
process  However,  non-minimal  unifiers  affect  the  execution  time  of  both  the  algorithm  and  its  ap¬ 
plications  and  thus  one  measure  of  a  good  algorithm  should  be  that  it  produces  few  non-minimal 
unifiers.  A  related  open  problem  is  to  find  a  minimal  combining  unification  algorithm  that  avoids 
exponential  filtering. 

Efforts  to  gain  significant  improvements  in  efficiency  through  parallel  processing  are  limited  by 
some  lower  bounds  in  that  area.  First,  note  that  E-unification  problems  for  which  the  best  algorithm  is 
exponential  will  have  at  best  exponential  parallel  algorithms,  since  we  have  only  a  polynomial  number 
of  processors.  For  theories  in  which  polynomial  unification  algorithms  exist,  improvements  through 
parallel  processing  may  be  possible.  However,  the  fact  that  empty  theory  unification  is  inherently 
sequential  [Dwork  84]  is  not  promising. 
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4.2.2  Removing  Restrictions  on  the  Theories 

A  challenging  area  of  research  that  is  not  addressed  in  this  thesis  is  the  problem  of  combining 
theories  in  which  constrained  symbols  are  not  disjoint.  Some  of  the  negative  results  on  the 
decidability  of  unification  problems  indicate  the  difficulty  of  this  problem.  The  undecidability  results  in 
this  area  have  been  based  on  the  undecidability  of  Hilbert’s  10th  problem,  solving  Diophantine  equa¬ 
tions  over  the  integers,  which  was  shown  undecidable  by  MatiyaseviC  [Davis  73].  [Arnborg  85]  and 
[Szabo  78]  investigate  combinations  of  theories  with  non-disjoint  function  symbols  by  studying  the 
lattice  of  sub-theories  that  are  consistent  with  Peano  arithmetic.  [Szabo  78]  shows  the  undecidability 
of  the  associative  theory  with  two  sided  distributive,  AD,  while  [Arnborg  85]  shows  the  undecidability 
of  the  associative  theory  with  one-sided  distributive  and  a  right  and  left  identity  element,  AD,U  or 
ADfU.  In  both  cases  it  was  also  shown  that  any  theory  consistent  .vith  Peano  arithmetic  and  contain¬ 
ing  the  AD  or  AD^  theory,  respectively,  also  has  an  undecidable  unification  problem. 

The  CR-unify  described  in  this  thesis  cannot  handle  non-terminating  procedures  for  enumerating 
unifiers.  For  the  sake  of  notational  convenience,  the  CR-unity  invokes  a  sub-theory  algorithm  which 
returns  a  complete  set  of  unifiers,  although  processing  of  unifiers  could  be  done  one  at  a  time.  It  may 
be  possible  to  get  a  combining  algorithm  for  non-terminating  unification  procedures  by  processing  on 
sub-theory  unifier  before  the  others  are  generated.  The  interesting  problem  here  is  to  show  that  the 
resulting  procedure  is  a  complete  generating  procedure  for  the  combined  theory. 

An  obvious  problem  that  is  left  open  in  this  thesis  is  combining  unification  algorithms  when  the 
theories  may  be  either  unconfined  or  non-regular.  The  problems  that  arise  in  trying  to  extend 
CR-unify  were  characterized  by  the  examples  in  Section  2.4.  We  found,  in  general,  that  it  was  not 
hard  to  guarantee  consistency,  but  that  the  properties  of  termination  and  completeness  seem  to 
conflict.  It  was  possible  to  get  a  provably  complete  procedure  if  one  assumed  termination,  while  in 
actuality  the  procedure  would  loop  in  a  trivial  manner  before  any  unifiers  were  generated.  Alter¬ 
natively,  a  terminating  algorithm  could  be  achieved,  but  it  was  found  that  the  algorithm  was  incom¬ 
plete  on  some  non  trivial  examples. 

Before  our  algorithm  can  be  extended  to  unconfined  or  non-regular  theories,  further  theoretical 
work  must  be  done.  The  experience  gained  in  this  work  shows  that  the  problem  of  generalizing 
unification  procedures  is  not  trivial,  that  seemingly  obvious  approaches  are  not  always  correct,  and, 
therefore,  that  algorithms  in  this  field  require  detailed  descriptions  and  careful  proofs  of  consistency, 
completeness,  and  termination. 


Appendix  A 

Protection  of  Variables  in  CR-unify 


A  technical  issue  that  we  have  avoided  discussing  in  detail  until  now  is  the  generation  and  protec¬ 
tion  of  new  variables  in  CR-unify.  A  unification  algorithm  is  often  used  in  a  larger  system,  and  that 
system  may  have  variables  of  its  own,  and  it  is  important  that  any  new  variables  generated  by  CR-unify 
do  not  coincide  with  those  existing  externally.  This  problem  could  be  handled  by  simply  renaming 
variables  after  performing  unification,  but  the  more  general  problem  comes  up  within  the  CR-unify 
algorithm  because  of  the  recursion.  It  is  important  to  all  three  correctness  properties,  i.e.,  consis¬ 
tency,  completeness  and  termination,  that  new  variables  generated  on  recursive  calls  do  not  coincide 
with  those  existing  in  subterms  not  involved  in  the  recursion.  For  example,  if  two  new  variables 
appear  together  and  both  use  the  same  name,  the  resulting  unifier  may  be  less  general  than  intended 
substitution.  In  particular,  the  property  of  idempotence  of  unifiers,  used  in  the  proofs,  depends  on  the 
disjointness  of  domain  and  range  variables.  This  disjointness  can  only  be  guaranteed  if  the  variables 
in  the  domain  of  one  factor  of  a  substitution  can  be  protected  from  appearing  in  the  range  of  another 
factor.  The  parent  preserving  property  of  the  termination  proof  also  depends  on  the  protectiveness  of 
recursive  calls  to  CR-unify. 

In  the  implementation  the  protection  problem  corresponds  to  the  problem  of  generating  globally 
unique  identifiers  from  within  any  local  procedure  environment.  The  solution  in  the  implementation  is 
to  pass  an  object  for  generating  unique  identifiers  to  each  unification  procedure  and  to  guarantee  a 
priori  that  all  variables  in  the  input  are  disjoint  from  any  variables  that  may  be  generated.  We  do  this 
by  picking  a  special  prefix  for  generated  variables  and  concatenating  a  unique  integer  whenever  a 
new  variable  is  needed.  In  the  formal  context,  it  is  more  convenient  to  pass  the  set  of  variables  to  be 
protected  than  to  pass  a  function  for  generating  id  ntifiers,  although  the  two  approaches  are  effec¬ 
tively  the  same. 

We  begin  by  imposing  a  total  order  on  the  universe  of  variables,  V.  If  V  is  any  set  of  variables, 
Wext(V)  denotes  the  smallest  variable  in  V,  as  defined  by  the  imposed  ordering.  The  homogenizing 
function,  Homog ,  is  then  modified  to  incorporate  the  protected  set  of  variables.  The  following 
modification  to  the  definition  of  Homog  will  legitimize  calling  Homog  a  function,  since  it  will  now  be 
mathematically  well-defined.  The  definition  given  here  replaces  the  earlier  one. 


Definition  Let  F  be  a  set  of  function  symbols.  W  be  a  finite  set  of  variables,  and  t  be  a  term  such  that 
HO  C  VV.  Horn og(;.  F,  W)  is  defined  as  follows: 


1.  If  r  is  a  variable,  then  Homog(t,  F, )  =  t. 

2.  If  t  =  /(/, . gand/€F,  then 

Homog(t.  F,  W)  =  f(Homog(ty  F,  Wj) . Homog{tn,  F,  Wn)),  where  W1  =  W  and  for  i  >  1 , 

VV  =  W.1UHHomo9(f.1.F.Wj1)). 

3.  If  f  =  /(f, . rj  and  /  €  F,  then  Homog(t,  F,  W)  =  We*f(V-W). 

Figures  4-1  and  4-2  show  the  CR  unify  and  map-unity  procedures,  respectively.  For  the  sake  of 
consistency  between  Figure  2-3  and  4-1,  we  will  abuse  our  notation  slightly.  The  shorthand  form  of 
the  homogenizing  function,  f,  was  used  in  the  less  formal  description  of  CR-unity  in  Figure  2-3. 
Because  it  does  not  allow  for  specification  of  the  the  set  of  protected  variables,  we  will  use  the 
longhand  form,  Homog.  and  use  f  and  s  as  identifiers  in  the  code;  they  represent  the  same  values  as 
in  Figure  2-3. 

For  simplicity  we  assume  the  set  of  protected  variables,  W,  contains  all  variables  in  the  other 
inputs  arguments  l.e.,  HOUTfs)  C  W  in  calls  to  the  CR-unify  procedure,  C  W  in 

CR-variabie-unify,  and  rJ(<p1)U3(gp1)U£f(gp2)U3(qp2)  CW  in  map-unify. 


CR-unity  =  proc  (f,  s:  term,  W:var_set)  returns  (subst_set) 


case 

is_variable(f)  and  is_variable(s)  =>  %  case  1 

return({{/«-s}}) 

is_variable(/)  and  ~is_variable(s)  =>  %  case  2 

ret\iTt\(CR-variable-unify{t,  s,  W)) 
is_variable{s)  and  ~is_variable(f)  =>  %  case  3 

return [CR-variable-unify(s,  t,  W)) 
t.head  *  s.head  =>  %  case  4 

n 

return  (0) 

t.head  =  s.head  =>  %  case  5 

f :  =  Homog(t,  t.head,  W) 
s  :  =  Homog(s,  t.head,  WUI^O)- 


y  :  =  Preserve (t,  t)  U  Preserved,  S) 

P :  =  Eithead]uni,y{t’ 1  WU'HOUns)) 

2 :  =  U  map-unify(p,  y,  WU'H?)U,HS)UJ(p)) 

P€F 

return(2|rwut?j)) 

end 

end 

end  CR-unity 

CR-variable-unify  =  proc  (v:  variable, s:  term,  W:var_set)  returns  (subst_set) 
s  :  =  Homog(s,  s.head,  W) 
y  :  =  Preserved,  s) 
case 

»{%)=»  %  case  A 

return  ({{v  ♦■s}}) 

V  €  r\s)  &  V  i  3(y)  =*  %  case  B 

P:  = 

2:=  U  map-unify[p,  y,  WUfIs)U5(p)) 

P€P 

return(2|{v}u1fls)) 

v  €  t(s)  &  v  €  J(y)  =>  %  case  C 

return({}) 
end 

end  CR-variable-unify 

Figure  4-1 :  A  Careful  Description  of  the  CR -unify  Procedure 


map-unity  =  proc  (<pv  <p2:subst,  W:var_set)  returns(subst_set) 
Vi:=9(9))-3(T2) 

V2:=3(v2) 

20=(0 
*  :=  0 
WQ:  =  W 
for  j  =  1  to  2  do 
for  v  in  V.  do 
i :  =  i  +  1 

&  u.€CR-unify(o.  ^  v ,  a.^<p2v ,  W.)} 

end 

returntZ,) 
end  map-unify 

Figure  4-2:  A  Careful  Description  of  the  map-unify  Procedure 


We  will  use  an  inductive  argument  to  show  the  protectiveness  of  CR-unify.  Recall  that  the  con¬ 
ditions  comprising  protectiveness  of  a  set  2  of  unifiers  of  f  and  s  are: 

Va€Z  5f(<r)CV  &  W-VnJ(o)  =  0 

&3(o)nj(o)  =  0 

where  V  =  Tl[f)UT(s).  The  protectiveness  of  each  step  depends  on  W  containing  all  variables  in  the 
input  arguments,  so  it  is  important  that  recursive  calls  maintain  this  convention. 

The  applications  of  Homog  to  t  and  s  in  both  CR-unify  and  CR-variable-unify  of  Figure  4-1  meet 
this  requirement,  since  TfOUT^sJCW.  Calls  to  the  sub-theory  unification  algorithm  case  5  and  case  B 
explicitly  add  any  new  variables  from  f  and  s  to  W.  The  only  non-trivial  case  is  the  invocation  of 
map-unify,  and  here  we  know  by  construction  of  y  that  all  variables  in  3(y)U5(y)  are  in  either  T[t), 
Y\s),  l^f),  or  'Hs).  Furthermore,  by  protectiveness  of  the  sub-theory  unification  algorithm, 
^pCTf^UTfs).  By  explicitly  adding  3{p)  as  well  as  T^f)  and  T|s)  to  W,  we  are  guaranteed  to  cover  all 
variables  in  both  y  and  p.  Note  that  the  protected  variables  in  calls  to  one  invocation  of  map-unify  do 
not  contain  variables  generated  from  a  previous  invocation  of  map-unify. 

Map-unify  also  maintains  the  convention  of  having  all  variables  from  the  arguments  in  the 
protected  set,  since  it  explicitly  places  any  new  variables  from  one  invocation  into  the  protected  set 
for  the  next  invocation. 


Given  this  assumption  on  W,  we  can  see  that  CR-unify  produces  a  protective  set  of  unifiers. 
Cases  1 .  2,  3.  4.  A.  and  C  are  obvious.  In  cases  5  and  B,  if  a  is  a  returned  unifier,  then  <2{o)Cf\t)UY\s) 
by  the  explicit  restriction  of  I.  The  two  properties  on  the  range  of  a,  (W-V)n5(o)  =  0  and 
cJ(a)fU(o)  =  0  follow  because  a  is  the  composition  of  substitutions  formed  by  recursive  calls  to 
CR-unify.  and  in  each  case  W  is  a  subset  of  the  protected  set  in  the  recursive  call. 
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Appendix  B 

Glossary  of  Terms 


..(-assignment  a  mapping  from  terms  to  elements  of  algebra  JL. 

algebra  A  set  of  elements  and  a  set  of  function  on  the  elements. 

carrier  The  set  of  elements  in  an  algebra. 

clash  In  classical  unification,  this  is  the  problem  that  occurs  when  trying  to  unify  terms 

with  different  head  symbols. 

congruence  relation 

An  equivalence  relation  closed  under  the  equality  rule,  r1 — s1,...,  fn~sn  => 
/(f1 . fn)~/(s, . sn)  for  all  /  G  F  of  arity  n. 

complete  set  of  unifiers 

A  generating  set  for  the  set  of  all  unifiers. 

completeness  The  property  on  unification  algorithms  that  guarantees  a  complete  set  of  unifiers 
is  always  found. 

confined  A  set  of  equations  is  confined  if  is  contains  no  equations  with  a  variable  equal  to  a 

non-variable  term. 

consistency  The  property  on  unification  algorithms  that  guarantees  all  returned  substitutions 
are  unifiers. 

constants  Function  symbols  of  arity  0,  denoted  a,b,c,d,0,or}. 

corresponding  pair  The  pair  of  l.rms  with  which  CR-uni/y  is  invoked  from  within  map-unify.  The  pair 
is  formed  by  picking  a  variable  in  the  domain  of  the  two  substitutions,  applying 
each  substitution  to  the  variable,  and  then  applying  any  previously  accumulated 
substitution. 

cycle  The  problem  that  occurs  in  classical  unification  when  unifying  a  variable  with  a 

term  containing  that  variable.  More  generally  in  equational  unification,  this  hap¬ 
pens  whenever  the  unification  is  an  infinite  term. 

domain  The  domain  of  a  substitution,  denoted  9(<r)  is  the  set  of  all  variables  mapped  to 

something  other  than  themselves,  i.e.,  3( a )  =  {v|av*v}. 

elementary  substitution 

Substitution  with  a  domain  of  size  1  or  0. 


elementary  non-increasing 

A  fairly  technical  definition  giving  sufficient  conditions  on  elementary  substitu¬ 
tions  such  that  the  complexity  (-<q)  of  two  terms  is  not  increased  by  applying  the 
substitution. 

ground  terms  The  set  of  terms  formable  from  only  function  symbols,  i.e.,  no  variables, 
head  The  leftmost  symbol  of  a  term. 

homogeneous  A  term  is  homogeneous  with  respect  to  a  set  of  function  symbols  is  the  term 
contains  no  function  symbols  outside  that  set.  A  substitution  is  homogeneous  if 
all  terms  in  its  range  are  homogeneous  with  respect  some  set  of  function  symbols. 
(See  Homog ,  U-Homog,  and  U-HomogMap .) 


Homog 


instance 


instantiation 


map-unify 


match 


The  homogenizing  operation  on  terms,  denoted  (f),  that  replaces  subterms  with 
new  variables. 

A  term  t  is  an  instance  of  s  is  t  =  as  for  some  substitution  a.  Similarly,  a  substitu¬ 
tion,  <p1  is  an  instance  of  <p2  if  and  only  if  there  exists  <p3  such  that  <p1  =  <P30<P2'  V2 
is  said  to  be  more  general  than  <p2  in  this  case,  and  the  partial  order  on  substitu¬ 
tions  is  denoted  <p1  S 

A  rule  of  inference  used  in  equational  logic  From  t  =s  deduce  at  =  as. 

A  procedure  for  finding  unifiers  of  substitutions. 

A  substitution  mapping  a  term  to  an  instance  of  itself. 


minimal  complete  set  of  unifiers 

An  set  of  substitutions  that  generates  all  unifiers  and  contains  no  redundant  sub¬ 
stitutions. 

minimal  partition  The  smallest  partition  on  a  presentation  of  a  theory  that  preserves  disjointness  of 
operators. 

minimality  A  property  on  unification  algorithms  that  guarantees  no  returned  substitution  is  an 

instance  of  another  returned  substitution. 

more  general  modulo  E 

A  partial  order  on  substitutions,  denoted  <  ,  that  is  similar  to  £  except  =  is  used 
in  place  of  term  equality.  E  E 

most  general  unifier 

In  classical  unification  this  is  the  unique  unifier  of  which  all  other  unifiers  are 
instances.  In  E-unification  there  may  be  a  set  of  most  general  unifiers. 

non  confining  An  equation  is  non  confining  if  it  is  of  the  form  v  =f  or  t  =  v,  where  t  a  non  variable 
term. 

non-increasing  A  substitution  formed  of  only  elementary  non- increasing  factors. 
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occurrence 


A  of  integers,  o,  denoting  node  within  a  term.  For  example,  /(/,,. ..,fn)/i.o  =  t  /o, 
where  t/o  denotes  the  term  at  occurrence  o  in  t. 

parent  A  parent  of  i  in  s  is  an  operator  in  s  having  t  as  an  argument.  The  set  of  all  parents 

of  t  in  s  is  written  Parents(t,s). 

parent  set  The  equivalence  of  a  parent.  The  set  of  all  parents  sets  of  t  in  s  is  written 

ParSets(t,s). 

partitioned  presentation 

A  set  of  sets  of  axioms  with  pairwise  disjoint  operators, 
presentation  A  set  of  axioms  for  an  equational  theory, 
preserving  substitution 

The  substitution  mapping  f  to  t,  which  is  just  the  match  of  f  by  f. 
proper  occurrence  An  occurrence  other  than  the  empty  occurrence,  e. 

protection  A  property  on  unification  algorithms  limiting  the  variables  that  can  appear  in  an 

answer,  i.e.,  all  answers  must  be  protective  unifiers. 

protective  unifier  A  unifier,  a,  of  t  and  s  is  protective  if  is  domain  and  range  contain  disjoint  sets  of 
variables,  the  domain  is  a  subset  of  the  variables  in  t  and  s,  and  any  variables  in 
the  range  of  a  than  do  not  occur  in  t  or  s  are  new  variables. 

quotient  algebra  The  algebra  formec  from  another  algebra  by  taking  equivalence  classes  of  ele¬ 
ments  of  the  second  as  elements  of  the  first.  For  example,  tf/  =  denotes  the  term 
algebra  modulo  an  equational  theory  E*.  E 

range  The  range  of  a  substitution  is  the  set  of  terms  to  which  some  element  of  the 

domain  is  mapped. 

regular  An  equation  is  regular  if  the  right  and  left  sides  contain  identical  sets  of  variables. 

A  set  of  equations  is  regular  if  every  element  is  regular. 

relevant  function  symbols 

The  set  of  function  symbols  constrained  by  a  sub-theory.  In  most  cases  this  is  the 
set  of  function  symbols  in  a  presentation  of  the  theory  given  in  w;  in  the  case  of 
the  empty  theory,  this  is  the  set  of  symbols  that  do  not  appear  in  any  axioms. 

significant  occurrence 

An  occurrence,  o  is  significant  if  the  occurrence  just  above  it  does  not  have  the 
same  operator  as  o,  where  same  means  = .  All  variable  occurrences  as  well  as  the 
empty  occurrence  are  significant.  ” 

significant  subterm  A  subterm  is  significant  if  it  occurs  at  a  significant  occurrence. 


strict  occurrence  A  non-variable  occurrence. 
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strictly  consistent  A  theory  is  strictly  consistent  if  and  only  if  x  =  i  is  not  in  the  theory.  A  theory  that 
is  not  strictly  consistent  contains  all  equation!. 

strict  theory  A  theory  is  strict  if  for  any  set  of  unificands  having  a  unifier  in  common,  the 
transitive  closure  of  -<  is  a  strict  ordering  on  the  set. 

strongly  complete  theory 

A  theory  is  strongly  complete  if  for  any  pair  of  terms,  a  variable,  *,  and  non¬ 
variable.  r.  if  >  and  t  are  unifiable  then  there  is  complete  set  of  unifiers  such  that 
every  substitution  in  the  set  has  a  domain  of  {.t }. 

sub-theory  If  E|  is  an  element  of  it.  then  E*  is  a  sub  theory  of  E*. 

substitution  A  mapping  from  variables  to  terms  extended  to  a  mapping  from  terms  to  terms. 

subterm  ordering  Denoted  t-<s,  this  partial  ordering  on  terms  holds  when  t  is  a  subterm  of  s. 

theory  A  set  of  equations  that  is  closed  under  rules  of  inference. 

U  Homog  A  homogenizing  function  on  terms  that  replaces  subterms  with  elements  of  U 

rather  than  new  variables  as  in  Homog.  7  is  used  when  the  value  of  F  is  clear  from 
context. 

U  HomogMap  A  homogenizing  function  on  substitutions,  denoted  o,  that  homogenizes  each 
term  in  the  range  and  maps  some  elements  of  U  to  others. 

unifier  In  the  classical  case,  a  substitution,  o.  is  a  unifier  of  two  terms  f  and  s  if  and  only  if 

ot  =  os.  In  the  more  general  equational  unification,  term  equality  if  replace  by 
equality  in  an  equational  theory. 

CR-unify  An  algorithm  for  unifying  in  confined  regular  theories. 

uninterpreted  Function  symbols  that  do  not  appear  in  a  presentation  of  a  theory  and  unin¬ 
terpreted. 

universal  E-preserving  substitution 

The  substitution,  ji,  that  maps  the  homogeneous  form  of  a  term  under  U-Homog  to 
a  term  within  E  of  the  original. 

variables  The  universe  of  variables  is  denoted  V  while  individual  variables  are  denoted  by 

either  u,  v,  w,  x,y,or  z. 


Appendix  C 
Special  Symbols 


An  algebra. 

The  noetherian  ordering  on  inputs  to  CR-unify  used  to  show  termination.  -<£  is 
the  lexicographic  extension  of  the  cardinalities  of  r  and  r. 

The  domain  of  a  substitution,  i.e.,  3(a)  =  { v|a  v  *  v}. 

Used  to  denote  the  ordering  on  substitutions,  «p1  <  <p2  if  and  only  if  there  exists  <j>3 
such  that  <p3°9t  =  q>2- 


The  congruence  relation  on  terms  defined  by  an  equational  theory  of  E.  Also  used 
to  denote  the  congruence  relation  extended  to  substitutions. 

The  equational  theory  presented  by  the  set  of  axioms,  E.  I.e.,  E  denotes  any  set  of 
equations  whereas  E*  denotes  a  closed  set. 

The  set  of  equations  valid  in  all  models  of  the  class  M. 

The  universe  of  function  symbols,  i.e.,  the  signature  of  the  entire  theory  E*. 

The  set  of  function  symbols  in  f 

A  substitution,  usually  used  to  denote  the  preserving  substitution  of  a  term  and  its 
homogeneous  form  or  the  combined  preserving  substitution  for  a  pair  of  terms. 

The  set  of  all  ground  terms  formable  from  F. 

The  empty  substitution. 

The  function  symbol  at  the  head  (or  root)  of  t. 

The  empty  occurrence,  i.e.,  the  empty  string.  For  any  term  t/e  ■  f. 

The  set  of  all  variables  in  the  range  of  a  substitution,  i.e.,  3(a)  =  {v|  v€T(0  for 
/€*(a)}. 

An  ordering  on  pairs  of  terms  defined  in  [Kirchner  85]  that  is  used  in  defining  a 
strict  equational  theory.  One  pair  is  less  than  another  if  a  term  in  the  first  pair  is  a 
variable,  and  that  variable  occurs  in  a  non-variahle  term  in  the  second  pair. 


The  set  of  all  models  of  E. 


The  universal  E-preserving  substitution. 


M 

v  An  ^-assignment,  or  interpretation,  mapping  terms  to  objects  in  an  algebra. 

n  The  null  operator,  used  to  denote  the  parent  of  the  a  term  in  itself. 

v(t,  s )  The  set  of  variables  in  t  and  s  that  occur  under  more  than  one  equivalence  class 

of  parent  operators.  The  cardinality  of  v  is  denoted  by  ».  (r  is  in  no  way  related  to 
the  use  of  v  as  an  U-assignment.) 

U  The  set  of  special  variables  used  to  denoted  congruence  classes  of  terms. 

0(f)  The  set  of  occurrences  in  t 

u  A  substitution,  used  in  this  thesis  to  denote  factors  of  a  unifier  as  it  is  being  built  in 

CR-unify. 

Parentsft,  s)  The  set  of  parent  operators  of  f  in  s. 

ParSets(t,  s)  The  equivalence  classes  of  parent  operator  of  f  in  s. 

v  The  partitioned  presentation,  usually  assumed  to  present  the  theory  E*. 

=  The  equivalence  relation  on  function  symbols  defined  by  the  partitioned  presen- 

*  tation,  v. 

<p  A  substitution. 

p  A  substitution,  used  in  this  thesis  to  denote  a  sub-theory  unifier  of  two 

homogeneous  terms. 

a  A  substitution,  used  the  CR-unify  to  denote  a  unifier  or,  when  subscripted,  a  par¬ 

tially  formed  unifier. 

a  The  terms  in  the  range  of  a  substitution,  i.e.,  9>(o)  =  {ap|v£9(u)}. 

S  The  set  of  aft  possible  substitutions. 

r(f,  s)  The  set  of  significant  subterms  of  f  and  s.  The  cardinality  of  r  is  denoted  by  r. 

T  =  T(F,  V)  The  set  of  all  terms  formable  from  F  and  V. 

% '/  =  The  quotient  algebra  of  the  term  algebra  modulo  and  equation  theory  congruence 

E  relation  on  terms. 

V  The  universe  of  variables. 

T[t)  The  set  of  variables  in  t. 

[v  Restricts  the  domain  of  a  substitution:  a|v  =  {v«-av  |  v€V).  Also  used  to  restrict 

sets  of  substitutions:  Z|v  =  {ofv  |  e€Z). 
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Functional  composition.  For  substitutions  1  and  y2,  <p f°<p2t  =  f°r  any 

term  t. 

The  function  symbols  at  the  head  of  a  term  t. 

Validity. 

Set  of  all  E-umfiers  of  two  terms. 

Complete  set  of  unifiers. 

Minimal  and  complete  set  of  unifiers 

The  equivalence  class  of  function  symbols  (defined  by  v)  that  contains /. 

The  homogeneous  form  of  t.  Homogft ,  [t.head]).  Homogenization  is  done  with 
respect  to  the  set  of  relevant  function  symbols  for  some  sub  theory  of  E*  such 
that  the  head  of  t  is  in  the  set.  I.e.,  it  is  the  maximum  homogeneous  term  at  the  top 
of  t  where  new  variables  take  the  place  of  subterms  outside  the  homogeneous 
part. 

The  homogeneous  form  of  t  which  is  similar  to  F  except  subterms  are  replaced 
with  elements  of  the  special  set  U  rather  than  new  variables. 

The  subterm  ordering  on  terms.  I.e.,  t-<s  if  and  only  if  t  is  a  subterm  of  s.  f-<s  may 
be  used  if  t  is  a  proper  subterm  of  s. 

The  significant  subterm  ordering,  i.e.,  if  and  only  if  t  is  a  subterm  of  s  and  t  is 
significant  in  s.  If  t  is  also  know  to  be  proper  in  s,  f-<ys  may  be  used. 
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